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PREFACE 


The beginner in experimentation too often finds himself supplied with 
a pair of elaborate mechanisms. In the one hand is a mass of data 
demanding simplification and interpretation, while in the other is a com- 
plex statistical methodology said to be necessary to research. How shall 
the two be geared together? Since the data can be only inefficiently 
utilized without statistical method, and since method is futile until applied 
to data, it seems strange that greater effort has not been made to unite 
the two. For those of some experience there are adequate texts and journal 
articles. It is the novice to whose needs this book is directed. It is hoped 
that he may be furnished with a smoothly working combination of experi- 
mental data and statistical method. 

Like all other sciences, statistics is in a stage of rapid evolution. During 
the last 20 years new discoveries have swiftly succeeded each other, fruit- 
ful syntheses have been effected, novel modes of thought have developed 
and a whole series of brand new statistical methods have been marketed. 
The biologist who has not been able to keep abreast of the progress of 
statistics finds himself a bit confused by the new ideas and technical 
terms. It is thought that he will welcome a statement of them in a form 
that will not require too much distraction of his attention from necessary 
professional duties. 

It is a fundamental belief of the author that statistical method can be 
used competently by scientists not especially trained in mathematics. 
The conditions surrounding the mathematical theorems can be set forth 
in terms quite readily understood by the lay reader. Since mastery of 
two sciences is possible for only few, it is necessary for most of us to advance 
by cooperation. To the mathematical statistician must be delegated the 
tasks of developing the theory and devising the methods, accompanying 
these latter by adequate statements of the limitations on their use. None 
but the biologist can decide whether the conditions are fulfilled in his 
experiments and interpret the results. The only mathematics used in this 
book is arithmetic, supplemented by enough symbolism to make the expo- 
sition intelligible. 

In the course of the development of each bit of scientific knowledge 
there comes a time when the experimental techniques must be questioned. 
Are they adequate to furnish the demanded precision of results? In what 
respects need they be improved? Is the most hopeful point of attack in 
the laboratory methods or in the experimental material? Fortunately, 
statistical methods supply answers, in many cases with little or no v extra 
labor in collecting data, provided only that slight but necessary modilica^ 
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tions be included in the plan of the experiment. Some of these tests of 
technique can be discussed even in this elementary presentation. 

Small sample methods are prerequisite in most biological data. For 
that reason, they are introduced at the start. The classical theory of 
large samples receives scant attention. In most places where it is mentioned 
at all it is introduced as a simplified special case of the small sample. 

The arrangement of the material in this text is not so much logical as 
developmental. The easiest ideas are put first, and only one new concept 
is presented at a time. The experienced reader will often feel a sense of 
inadequacy. It is believed that this will disappear as he continues, and 
that the inexperienced will be inducted with a minimum of difficulty. 
Numerous examples form an indispensable part of the presentation. In 
most of them the statistical method, with its meaning, is emphasized, 
the necessary drudgery of calculation being reduced to the lowest level. 

Certain diligent but misguided enthusiasts have brought down upon 
statistics the opprobrious description, “dry as dust.” Of course, one must 
take into consideration the point of view. Data on golf scores, operations, 
and babies are arid indeed to the listener, but of absorbing interest to the 
narrator. We have endeavored to present the subject in a different aspect. 
Fundamentally, statistics is a mode of thought. Biometrics is a delineation 
of living things. While the mechanism of description is always likely to be 
tedious, the effort has been made to emphasize the subject portrayed 
rather than the technique of the portrayal. 

Statistics at Iowa State College is a cooperative enterprise. In a sense 
the author is merely reporting the thinking of his colleagues. Their interest, 
advice and help have made possible the experience upon which this book 
is founded. Their generous contributions of experimental data and tech- 
nical knowledge will, if I have succeeded in interpreting them adequately, 
be helpful to others engaged in research. 

It is a pleasure to acknowledge the leadership of Professor R. A. Fisher. 
Even he who runs may read my appreciation of his unifying contribu- 
tions to statistics. By his residences as guest professor in mathematics at 
Iowa State College, as well as through his writings, he has exercised a 
profound influence on the experimental and statistical techniques of the 
institution. He and his publishers, Messrs. Oliver and Boyd of Edinburgh, 
have been liberal in permitting the use of tables of functions. 

My collaborators in the Statistical Laboratory have been unsparing 
of their help. To A. E. Brandt, Gertrude M. Cox, II. W. Norton and 
Mary L. Greenwood, I am indebted for valuable criticisms, suggestions, 
and computational assistance. 

George W. Snedegor 


Statistical Laboratory 
Iowa State College 
September, 1937 
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PREFACE TO THE THIRD EDITION 

I seize this opportunity of expressing my grateful appreciation of the 
friendly and helpful comments received from readers in all parts of the 
world. In this third printing I have corrected a number of errors that 
have been called to my attention. New material has been added at the 
end of chapters 6 and 16, while an additional chapter 17 has been written 
to incorporate methods useful in some of the broader fields of sampling. 

My colleague, Professor W. G. Cochran, has given me indispensable 
help and criticism in revising and extending the text. 

G. W. S. 

September, 1940 


PREFACE TO THE FOURTH EDITION 

In this edition the text has been largely rewritten, and the scope has 
been widened as follows: (i) greater emphasis has been placed on the 
theoretical conditions in which the various statistical methods have 
validity, and concurrently (ii) on the conduct of the experiment so as to 
incorporate in the data the information desired; (iii) estimates and fidu- 
cial statements have been brought into equal prominence with tests of 
hypotheses; (iv) there is increased reliance on experimental samplings to 
exemplify distribution theory; (v) the treatment of correlation and of 
experimental designs has been expanded; and (vi) the methods for dis- 
proportionate subclass numbers have been extended to include all those 
necessary for ordinary needs. 

As before, I have leaned heavily on my colleagues in the Statistical 
Laboratory and on other members of the Iowa State College staff. My 
indebtedness to them is gratefully acknowledged. 


January, 1946 


G. W. S. 
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INTRODUCTION 


There is a fascination in experimental work felt even by those who 
are not so fortunate as to follow it as a vocation. In searching the un- 
known for new truths there is mystery, there is adventure, there is the 
thrill of discovery. As most of us know, also, there are grinding routine 
and disappointment. Research is a hard mistress, but her rewards bring 
satisfactions that few of us would forego. 

The experiment is the distinctive tool of science. As old as man him- 
self, its earliest forms were no more than observations of the results fol- 
lowing an innovation, usually accidental. Attracted by a lush growth of 
grain, the primitive scientist remembered burying a fish at the spot. 
So may have begun the fertilization of crops. The story goes that, to 
frighten away thieves from the grapes in Medoc, a mixture of lime and 
copper sulfate was sprayed on the posts supporting the vines. A keen 
observer, Millardet, noticed that the nearby leaves were free from mildew 
and suggested spraying the vines with “Bordeaux Mixture.” These 
crude experiments were the prototypes of some of our modern demon- 
strations. You arc shown two animals in a cage. One is sleek, alert and 
active, while the other is scrawny and nearly blind, the result, you are 
told, of deficiency of vitamin A in the ration. Again, you visit a farm 
where two adjacent plots of acid soil were sown with clover. One plot, 
treated with lime and inoculated with nodule bacteria, has a uniform 
stand of vigorously growing clover while the other is occupied chiefly by 
weeds. When differences are so great, no more elaborate experiment is 
needed to prove the benefit of the treatment. 

Scarcely are the more obvious questions answered, however, when 
the less apparent ones begin to clamor for investigation. Which foods 
contain the best supplies of vitamins, and what rate of application of 
lime will yield the maximum returns? The solution of these problems 
requires experiments which, while becoming more complex, may be so 
planned as to become concurrently more efficient. 

The results of experiments are usually recorded in number symbols. 
The reduction of these data to summary forms, with the accompanying 
tests for reliability, gives rise to a branch of arithmetic called statistical 
method. Inevitably, the necessity for statistics and the intricacy of the 
methods are augmented by every advance toward more critical experi- 
mentation. An adequate experimental design involves not only a satis- 
factory plan for conducting the trial, but also an appropriate statistical 
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method for evaluating the results. The two can ill be divorced. United, 
they lead to conclusions that lend themselves to statement in categorical 
terms. 

To be prepared for research in biology, one should master the simpler 
techniques of experimental design. Not only must the many precautions 
be observed to insure the success of the laboratory or field work, but also 
the appropriate statistical method must be available for reducing and 
testing the data. The more elaborate designs may not be needed at first, 
or indeed ever. However, they are all at the command of the biologist 
who has a thorough knowledge of the fundamentals of statistical method. 



Chapter 1 

SAMPLING OF ATTRIBUTES 


1.1 — The problems. Sampling of public opinion enjoys at the moment 
a popularity and confidence that are well deserved. Less in the public 
eye but no less useful are numerous other practical samplings — devices 
for learning about large masses by observing a few individuals. A dozen 
rats, for example, may suffice to disclose useful facts concerning a popula- 
tion that runs into millions. A carload of coal is accepted or rejected on 
the evidence gained from testing only a few pounds. The physician makes 
inferences about a patient’s blood through examination of a single drop. 
This practice of sampling, so widespread and so fraught with conse- 
quences, deserves more than passing scrutiny. 

If correct conclusions are to be drawn, the sample must be representa- 
tive of the population. Keen thinking and long experience have yielded 
extensive knowledge of how samples should be gathered for maximum 
information. But despite the best efforts of the investigator, he always 
faces the possibility that the results from any one sample may be far from 
the true values in the aggregate. In sections to follow, you will be con- 
fronted by the rather disconcerting variability of samples drawn from a 
common source. Taking precautions to make them representative is, 
indeed, one part of the sampler’s obligation; but he must learn, also, as 
he makes his inferences to allow for ever-present sampling fluctuation. 
Here then are the twin problems that face the statistician: the collection 
of appropriate samples and the drawing of valid conclusions from them. 

Sampling may be far-flung, as in the effort to learn how an election 
is going; or it may be confined to a greenhouse bench. While most of this 
book is devoted to this latter, the experimental type, it is easier to start by 
giving some account of the former. 

1.2 — A sampling described. To focus attention, let us study a project 
undertaken in 1942, the objective being to estimate the effect of labor 
shortage on the production of pigs in eastern South Dakota during the 
spring of 1943. The sampling was done in this manner: using two ran- 
domly selected assessors’ books from each of 38 counties, we mailed a 
questionnaire to every fifth farmer in these lists. A sample of 1,000 
replies was chosen so as to give uniform representation to all sizes of 
farms. Of these 1,000 farmers, 43% indicated that extra help would not 
enable them to increase their farrowing of spring pigs. 

Let us examine this procedure in some detail. As you read on, you will 
find insistent emphasis on the necessity of randomness in sampling to 
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simplify the reaching of correct inferences about the population sampled, 
the population in this case being the 54,000 farmers in eastern South 
Dakota. What is a random sample? The dictionary defines it as one in 
which the selection of the individuals is left to chance. Some mechanical 
method is desirable, such as the drawing of numbered capsules from a 
container in which they are thoroughly mixed. Every capsule should 
have the same likelihood of being drawn; that is, there should be no 
favoritism or bias. 

from this description, you will see that our sample of farms is not 
simply random. In the first place, the assessors’ books were not taken 
entirely by chance, but two were drawn at random from each county. 
Such is called stratified random sampling , and will be discussed more at 
length in subsequent chapters. In the second place, we used a systematic 
scheme of taking the names from the books, every fifth being selected. 
This, though often convenient, may lead to non-random sampling if 
there is any order to the way the names are entered in the list. That is, 
if the entry is by community, or alphabetical, the systematic selection 
may not yield a random sample. On the other hand, if the first name is 
chosen by lot and every filth taken thereafter, then every farm has the 
same chance of appearing in the sample; and if there is no grouping of 
the names in the books by family, by community, or otherwise, the result- 
ing sample should be random. Finally, the sample of 1,000 now under 
discussion was systematically spread over all farm sizes because it was 
thought that size might be related to labor needs. This is another kind 
of stratification for which appropriate methods will be developed in 
chapter 6. You see, we have used some devices which do not violate the 
requirement of randomness and which increase the precision of the 
sampling, but you will have to wait to learn how this greater precision 
can be utilized. For the present we shall assume that we have a wholly 
random sample. 

1.3 — Inferences about the population. As noted before, 430 or 
43% of the sample of 1,000 farmers reported that they could not increase 
the spring farrowing of pigs with additional help. What inferences can 
be made about the sampled population of 54,000 farmers? The first one 
is this: of the fraction of farmers in eastern South Dakota who could not 
increase their pig crops even if extra labor were available, 43% is an 
unbiased estimate. In other words, the sample percentage, 43%, is an 
unbiased estimate of the population percentage. This inference is usually 
taken for granted. Most people pass from sample fact to inference about 
the population with scarcely a thought. Logically, the two concepts are 
distinct, and unless the sampling is random the inference may not be true. 

The second inference about the population is made by use of table 
1.1 (2) *. In the first part of the table, indicated by 95% in the heading, 


Reference 2 at the end of the chapter. 
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look across the top line to the sample size of 1,000, then down the column, 
X/n, to 43%. At the intersection of the line and column you will see the fig- 
ures 40 and 46. The second inference is that between 40% and 46% of the 
population of 54,000 farmers probably cannot utilize extra help to increase 
the spring farrowings. Thus we have set bounds to the inaccuracy of the 
sample estimate. First we said that the observed percentage is an unbiased 
estimate of the population value but gave no indication of the amount 
by which it might be in error. Now we assert the likelihood that the true 
percentage is not more than 3 either way from 43%. 

Since these concepts lie at the foundation of sampling theory, let us 
illustrate them in another fashion. Imagine a bin filled with beans, some 
white and some colored, thoroughly mixed. Dip out a scoopful of them 
at random, count the number of each color and calculate the percentage 
of white, say 24%. This is an estimate of the fraction of white beans in 
the bin. But how close an estimate is it? That’s where the second infer- 
ence comes in. If there were 250 beans in the scoop, we look at the table 
and say that the portion of white beans in the bin is probably between 
19% and 30% of the total. What is the nature of the probability involved 
in this statement? It is this: in this scoopful we may have happened upon 
an unusual concentration of white beans, or of colored, despite our 
efforts at complete mixing and random sampling. The table heading 
indicates the chance of a correct inference as 95 in 100 or, as it is often 
written, 19:1. This means that if the scooping is repeated many times, 
each time with a new estimate and a correspondingly new confidence 
interval , and if each time we use the table to make an appropriate confi- 
dence statement about the population percentage, we may expect to be 
right in about 19 of every 20 trials. 

Summarizing the farm labor sampling, we have estimated the propor- 
tion of farmers in eastern South Dakota not able to utilize extra help as 
43%, and we have said that the true percentage lies between 40% and 
46% unless a one-in-twenty chance has occurred in the sampling. If you 
feel unsafe in making inferences with a 5% probability of being wrong, 
you may use the second part of the table in which P = 0.99, setting the 
wider interval, 39% to 47%. If one says that the population percentage 
lies within these limits, he will be right unless a one-in-a-hundred chance 
has come off. 

You now realize why it is necessaiy to distinguish sharply the sample 
estimate, known exactly, from the population ratio, usually unknown but 
confidently asserted to lie within the tabled interval. The confusion of 
these two concepts lies at the bottom of a lot of muddy thinking about 
statistics. 

Another form of the inferences can be used with samples such as ours, 
drawn from a population whose size is known, 54,000 farms. Not only 
the estimate, 43%, but both of the confidence limits, 40% and 46%, may 
be expanded by multiplication, 43% of 54,000 = 23,200, etc. (Avoid the 
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TABLE 1.1 

95% Confidence Interval (Percent) for Binomial Distribution 


Number 

Observed 

X 

Size of Sample 

Fraction 

Observed 

X/n 

Size of Sample 

10 

15 

20 

30 

50 

100 

250 

1000 

0 

0 

31 

0 

22 

0 

17 

0 

12 

0 

07 

0 

4 

.00 

0 

1 

0 

0 

1 

0 

45 

0 

32 

0 

25 

0 

17 

0 

11 

0 

5 

.01 

0 

4 

0 

2 

2 

3 

56 

2 

40 

1 

31 

1 

22 

0 

14 

0 

7 

.02 

1 

5 

1 

3 

3 

7 

65 

4 

48 

3 

38 

2 

27 

1 

17 

1 

8 

.03 

1 

6 

2 

4 

4 

12 

74 

8 

55 

6 

44 

4 

31 

2 

19 

1 

10 

.04 

2 

7 

3 

5 

5 

19 

81 

12 

62 

9 

49 

6 

35 

3 

22 

2 

11 

.05 

3 

9 

4 

7 

6 

26 

88 

16 

68 

12 

54 

8 

39 

5 

24 

2 

12 

.06 

3 

10 

5 

8 

7 

35 

93 

21 

73 

15 

59 

10 

43 

6 

27 

3 

14 

07 

4 

11 

6 

9 

8 

44 

97 

27 

79 

19 

64 

12 

46 

7 

29 

4 

15 

08 

5 

12 

6 

10 

9 

55 

100 

32 

84 

23 

68 

15 

50 

9 

31 

4 

16 

.09 

6 

13 

7 

11 

10 

69 

100 

38 

88 

27 

73 

17 

53 

10 

34 

5 

18 

.10 

7 

14 

8 

12 

11 



45 

92 

32 

77 

20 

56 

12 

36 

5 

19 

.11 

7 

16 

9 

13 

12 



52 

96 

36 

81 

23 

60 

13 

38 

6 

20 

12 

8 

17 

10 

14 

13 



60 

98 

41 

85 

25 

63 

15 

41 

7 

21 

13 

9 

18 

11 

15 

14 



68 

100 

46 

88 

28 

66 

16 

43 

8 

22 

14 

10 

19 

12 

16 

15 



78 

100 

51 

91 

31 

69 

18 

44 

9 

24 

.15 

10 

20 

13 

17 

16 





56 

94 

34 

72 

20 

46 

9 

25 

.16 

11 

21 

14 

18 

17 





62 

97 

37 

75 

21 

48 

10 

26 

17 

12 

22 

15 

19 

18 





69 

99 

40 

77 

23 

50 

11 

27 

18 

13 

23 

16 

21 

19 





75 

100 

44 

80 

25 

53 

12 

28 

19 

14 

24 

17 

22 

20 





83 

100 

47 

83 

27 

55 

13 

29 

20 

15 

26 

18 

23 

21 







50 

85 

28 

57 

14 

30 

21 

16 

27 

19 

24 

22 







54 

88 

30 

59 

14 

31 

22 

17 

28 

19 

25 

23 







57 

90 

32 

61 

15 

32 

23 

18 

29 

20 

26 

24 







61 

92 

34 

63 

16 

33 

24 

19 

30 

21 

27 

25 







65 

94 
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35 
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31 

22 

28 
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96 
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38 









62 

87 

28 

48 

.38 

32 

44 

35 

41 

39 









64 

88 

29 

49 1 

39 

33 

45 

36 

42 

40 









66 

90 

30 

50 

.40 

34 

46 

37 

43 

41 









69 

91 

31 

51 i 

41 

35 

47 

38 

44 

42 









71 

93 

32 

52 

.42 

36 

48 

39 

45 

43 






. 



73 

94'33 

53| 

.43 

37 

49 

40 

46 

44 









|76 

95 

34 

54 

.44 

38 

50 

41 

47 

45 









78 

97 

35 

55 

.45 

39 

51 

42 

48 

46 









81 

08136 

56 

46 

40 

52 

43 

49 

47 









83 

99 

37 

57 

.47 

41 

53 

44 

50 

48 









86 

100 

38 

58 

48 

42 

54 

45 

51 

49 









89 

100 

39 

59 

49 

43 

55 

46 

52 

50 









93 

100 

40 

60 

50 

44 

56 

47 

53 










! 



* 





t 


* If A' exceeds 50, read 100 — X =• number observed and subtract each confidence 
limit from 100. 

t If X/n exceeds 0.50, read 1.00 — X/n = fraction observed and subtract each 
confidence limit from 100. 
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TABLE 1 . 1 (i continued ) 

99% Confidence Interval (per cent) for Binomial Distribution 


Number 

Observed 





Size of Sample 





Fraction 

Observed 

Size of Sample 

















X 

10 

15 

20 

30 

50 

100 

X/n 

250 

1000 

0 

0 

41 

0 

30 

0 

23 

0 

16 

0 

10 

0 

5 

.00 

o 

2 

0 

1 

1 

0 

54 

0 

40 

0 

32 

0 

22 

0 

14 

0 

7 

.01 

0 

5 

0 

2 

2 

i 

65 

1 

49 

1 

39 

0 

28 

0 

17 

0 

9 

02 

1 

6 

1 

3 

3 

4 

74 

2 

56 

2 

45 

1 

32 

1 

20 

0 

10 

03 

1 

7 

2 

4 

4 

8 

81 

5 

63 

4 

51 

3 

36 

1 

23 

1 

12 

.04 

2 

9 

3 

6 

5 

13 

87 

8 

69 

6 

56 

4 

40 

2 

26 

1 

13 

.05 

2 

10 

3 

7 

6 

19 

92 

12 

74 

8 

61 

6 

44 

3 

29 

2 

14 

06 

3 

11 

4 

8 

7 

26 

96 

16 

79 

11 

66 

8 

48 

4 

31 

2 

16 

07 

3 

13 

5 

9 

8 

35 

99 

21 

84 

15 

70 

10 

52 

6 

33 

3 

17 

.08 

4 

14 

6 

10 

9 

46 

100 

26 

88 

18 

74 

12 

55 

7 

36 

3 

18 

.09 

5 

15 

7 

12 

10 

59 

100 

31 

92 

22 

78 

14 

58 

8 

38 

4 

19 

10 

6 

16 

8 

13 

11 



37 

95 

26 

82 

16 

62 

10 

40 

4 

20 

11 

6 

17 

9 

14 

12 



44 

98 

30 

85 

18 

65 

11 

43 

5 

21 

12 

7 

18 

9 

15 

13 



51 

99 

34 

89 

21 

68 

12 

45 

6 

23 

.13 

8 

19 

10 

16 

14 



60 

100 

39 

92 

24 

71 

14 

47 

6 

24 

14 

9 

20 

11 

17 

15 



70 

100 

44 

94 

26 

74 

15 

49 

7 

26 

15 

9 

22 

12 

18 

16 





49 

96 

29 

76 

17 

51 

8 

27 

16 

10 

23 

13 

19 

17 





55 

98 

32 

79 

18 

53 

9 

29 

17 

11 

24 

14 

20 

18 





61 

99 

35 

82 

20 

55 

9 

30 

.18 

12 

25 

15 

21 

19 





68 

100 

38 

84 

21 

57 

10 

31 

19 

13 

26 

16 

22 

20 





77 

100 

42 

86 

23 

59 

11 

32 

20 

14 

27 

17 

23 

21 







45 

88 

24 

61 

12 

33 

21 

15 

28 

18 

24 

22 







48 

90 

26 

63 

12 

34 

.22 

16 

30 

19 

26 

23 







52 

92 

28 

65 

13 

35 

23 

17 

31 

20 

27 

24 







56 

94 

29 

67 

14 

36 

24 

18 

32 

21 

28 

25 







60 

96 

31 

69j 1 5 

38 

25 

18 

33 

22 

29 

26 







64 

97 

33 

71116 

39 

.26 1 

19 

34 

22 

30 

27 







68 

99 

35 

72 

16 

40 

.27 

20 

35 

23 

31 

28 







72 

100 

37 

74 

17 

41 

28 

21 

36 

24 

32 

29 







78 

100 

39 

76 

18 

42 

29 

22 

37 

25 

33 

30 







84 

100 

41 

77 

19 

43 

.30 

23 

38 

26 

34 

31 









43 

79 

20 

44 

.31 

24 

39 

27 

35 

32 









45 

80 

21 

45 

32 

25 

40 

28 

36 

33 








1 

47 

82 

21 

46 

33 

26 

41 

29 

37 

34 









49 

83 

22 

47 

34 

26 

42 

30 

38 

35 









51 

85 

23 

48 

.35 

27 

43 

31 

39 

36 









53 

86 

24 

49 

36 

28 

44 

32 

40 

37 









55 

88 

25 

50 

.37 

29 

45 

33 

41 

38 









57 

89 

26 

51 

38 

30 

46 

34 

42 

39 









60 

90 

27 

52 

.39 

31 

47 

35 

43 

40 









62 

92 

28 

53 

.40 

32 

48 

36 

44 

41 









64 

93 

29 

54 

.41 

33 

50 

37 

45 

42 









67 

94 

29 

55 

.42 

34 

51 

38 

46 

43 









69 

96 

30 

56 

43 

35 

52 

39 

47 

44 









71 

97 

31 

57 

.44 

36 

5340 

48 

45 









74 

98 

32 

58 

.45 

37 

54 

41 

49 

46 









77 

99 

33 

59 

.46 

38 

55 

42 

50 

47 









80 

99 

34 

60 

47 

39 

55 

43 

51 

48 









83 

100 

35 

61 

48 

40 

56 

44 

52 

49 









86 

100 

36 

62 

49 

41 

57 

45 

53 

50 









90 

100 

37 

63 

50 

42 

58 

46 

54 













♦ 



t 


t 


* If X exceeds 50, read 100 — X — number observed and subtract each confidence 
limit from 100, 

f If X/n exceeds 0.50, read 1.00 — X/n — fraction observed and subtract each 
confidence limit from 100. 



6 


Statistical Methods 


Section 1.3 


fallacy of carrying more significant figures than are justified by the factors 
multiplied. See section 5.6.) We may infer (i) that 23,200 is an unbiased 
estimate of the number of farmers in eastern South Dakota who cannot 
use extra labor to increase their pig enterprises, and (ii) that the real 
number lies between 21,600 and 24,800 unless a one-in-twenty chance 
has occurred in the sampling. It may be noted that corresponding infer- 
ences may be made about the alternative class, those farmers who could 
increase the pig crop if extra labor were available, the estimate being 
57% (expanded to 30,800 farmers) with confidence limits, 54% and 60%. 

In this brief prevue I have displayed a goodly portion of the wares 
that the statistician has to offer; the sampling of aggregates, examination 
of the facts discovered in the sample, and inference's based upon these 
facts. The remainder of the book is occupied chiefly with more and more 
intricate applications of these basic processes. Before going further, you 
may clarify your thinking by working out a few examples. 

You will find that examples form an essential part of my presentation of statistics. 
In each list they are graded so that you may start with the easier . It is suggested 
that a few in each group be worked after the first reading of the text , lesennng the 
more difficult until experience is enlarged . It is my firmly grounded belief that statistics 
cannot be masteied without this or similar practice. 

EXAMPLE 1 1 — Of a sample of 100 seeds drawn at random from a bag of uni- 
formly mixed bluegrass seeds, 92 germinated in a standard test 'I he seed analyst 
wishes to make a statement about the pioportion of seeds in the bag that will germinate 
under similar conditions, and he is willing to run the risk of only one per hundred that 
his sample has misled him. What may he say? Ans. Between 83% and 97% will 
germinate. Hint: look in the table for 100-92 — 8%. 

EXAMPLE 1 2 — If the seed analyst in the foregoing example had tested 1,000 
seeds instead of 100 and had found only 8% ungerminated, what statement might he 
have made with 99*1 odds of being right? Ans Between 90% and 94% will germinate. 

EXAMPLE 1.3 — An investigator making a nation-wide survey interviewed 
115 women over 40 years of age from the lower middle economic level in rural areas 
of middle western states Forty-six ol them had listened to a certain radio program 
three or more times during the preceding month Assuming random sampling, what 
statement can be made about the population fraction of such women listening, using 
the 95% confidence interval? Ans. Approximately, between 30 4% and 49 0% listen. 
You will have to interpolate these values in the table. 

EXAMPLE 1 4 — A sampler of public opinion asked 50 men to express their prefer- 
ences between candidates A and B. Twenty preferred A. Assuming random sampling 
fiom a population of 5,000, the sampler stated that between 1,150 and 2,950 men ol 
the population preferred A. What was the probability of his having been led into error 
by his sampling? Ans. 0.01. 

EXAMPLE 1.5 — In a random sample from a uniform population, 86%; expressed 
approval of a certain agricultural program. On the basis ol this sample the statistician 
asserted that unless a one-in-twenty chance had come off the percentage of those 
favorable in the sampled population was between 81% and 90%. What was the size 
of the sample 1 * Ans. 250. 

EXAMPLE 1.6 — If you guess that in a certain uniform stratum about half of the 
housewives own a specified appliance, and if you wish to survey a sample that will, at 
the 95% confidence level, yield an estimate differing by not more than 6 from the correct 
percentage, about how large a sample must you take? Ans. 250. 
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EXAMPLE 1.7 — If you draw a sample properly, then make a statement about the 
population ratio, the statement will be either light or wrong but you will not know 
which. Do you see clearly how chance enters into the making of the statement? 

EXAMPLE 1 .8 — After the survey of a certain underprivileged community it was 
reported (14) that among 978 children less than age 16, 54.4% were girls. Does the 
95% confidence interval include a reasonable sex ratio for this group? 

EXAMPLE 1 9 — The F 2 plants of a certain cross in maize segregated in the ratio 
118 normal green to 71 chlorophyll deficient. What common genetic ratios might be 
appropriate? 

1.4 — Sampling: definitions and discussions. Size of sample. I 

have rushed you along through the first three sections, thinking you might 
enjoy a quick glimpse into the ideas and utility of statistics. Some points 
need considerable clarification. A few of them will be discussed in this 
section, while others will come up later. 

The heading of this chapter is “Sampling of Attributes.” So far wc 
have considered such diverse attributes as shortage of farm labor, germi- 
nation of seed, approval of an agricultural program, and chlorophyll 
deficiency in plants. The possession or lack of an attribute distinguishes 
the two classes of individuals making up the populations we have been 
studying. The record of the sampling consists of the numbers of indi- 
viduals found to have or lack the attribute under investigation. I call 
these iecords enumeration data in distinction from records of measurements 
of such variables as height or age. The theory and methods for measurement 
data will be developed in chapters 2 and 3. 

One must learn to distinguish clearly between those elements which 
are certain and those which are matters of chance. The possession of an 
attribute by an individual is a fact that may be certainly determined, but 
the selection of this individual in a sample is a chance event. Often heard 
is some such inaccurate reporting as, “The probability that an Iowa 
family owns a car is 90%. ” An Iowa family either owns a car or doesn’t, 
the accurate phraseology being, “The probability is 90% that a randomly 
selected Iowa family shall be found to own a car.” It is the selection of 
the family that is the chance event, not the ownership of the car. This 
distinction is difficult in statements about the confidence interval. The 
population ratio either does or does not lie within any stated interval, 
irrespective of the sampling. Furthermore, there is no uncertainty about 
reading the table. The chance lies in the drawing of the sample. Selection 
of the last member of the sample ends all uncertainty about the truth or 
falsity of the inference that will be made, despite the fact that the sampler 
may never learn whether his statement has been right or wrong. The 
probability involved is the probability that the sample drawn lead to a 
correct conclusion. If you make a practice of sampling, and if for each 
sample you assert that the population ratio lies within the 0.95 confidence 
interval for that sample, an average of 95 per hundred of your assertions 
will be right. Note that this theory rests solidly on the randomness of the 
sampling. 
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# The term “unbiased’ * requires some further comment. An unbiased 
sample implies an unbiased estimate of the population ratio, an estimate 
that tends toward the true value as the size of the sampling increases. 
This feature of random sampling is reflected in the narrowing confidence 
interval that accompanies expansion of sample size. In sampling from 
certain symmetrical populations, it may be said that estimates made from 
unbiased samples are as likely to be in excess of the population value as in 
defect. It is clear that to say “unbiased” is not to imply “without sampling 
variation.” While working the examples you were doubtless impressed 
by the fact that an estimate from a small sample, even though unbiased, 
may be very inaccurate. The point is that in unbiased samples accuracy 
increases with size, but the same cannot be said about biased sampling. 

Randomness in sampling is perhaps never quite attained in practice. 
It is nevertheless the mathematical model on which much statistical 
theory rests, and since the theory must be used in drawing conclusions 
from work-a-day samplings, it is to the interest of the investigator to 
approximate, as closely as feasible, the ideal conditions. The better the 
approximation the more nearly correct will be the inferences drawn. 

There are two common devices used to approach randomness of sam- 
pling; the population itself may be thoroughly mixed, or else some scheme 
may be found for leaving to chance the choice of the individuals taken. 
Often a combination is used, as in the sampling of farmers from the 
assessors* books: apparently the listing of the names was random, but 
the precaution was taken of using every fifth. In the next section you will 
assume complete randomness in the lists and take the sample consecu- 
tively. But it is a matter of experience that most lists, as well as most 
natural populations, have groupings into families, communities, social 
and economic levels, etc., which require much care if the operation of 
chance is to be insured. Several available methods will be discussed later. 

The question most often asked by experimenters and other designers 
of samplings is, “How large a sample must I take?” The answers that may 
be given in various circumstances are not easy and always involve chance 
variation. Nevertheless, at appropriate places in this text (see Index) 
attempts will be made to clarify the problems involved. You doubtless 
noticed the method of attack proposed in example 1.6, where the confi- 
dence interval and a guess about the population percentage were proposed 
as data. Perhaps you noticed, also, that the method doesn’t work so well 
in the upper and left hand portions of the table; not only do the confidence 
limits change rapidly with the unknown Y, but they are not equally 
spaced above and below this fraction. Despite these shortcomings of the 
table, answers can usually be got from it sufficiently good for practical 
purposes. It is futile to attempt exact estimates of sample size because 
unpredictable sampling variation will upset the best laid plans. At the 
present stage my advice is to read the table as accurately as you may, then 
take a somewhat larger sample if you can afford it. 
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1.5 — An exercise in sampling. Whenever I draw samples from & 
known population I am amazed at the capricious way in which the items* 
turn up. It is a salutary discipline for an experimenter or enumerator 
occasionally to observe the laws of chance in action lest he become t$o 
confident of his professional samplings. Drawing marbles from an urn 
or cards from a pack, tossing coins and throwing dice are the classical 
methods of doing this. Heretofore I have asked my students to collect 
their samples by drawing marked or numbered beans from a bag, but 
the more modern method is to use a table of random numbers such as 
table 1.2. This table contains 10,000 digits jumbled together presumably 
in random fashion, the 5x5 blocks serving merely to facilitate reading. 
There are 100 rows and 100 columns, each numbered from 00 to 99. In- 
stead of examining peas to learn if they are wrinkled or smooth, or instead 
of asking people to vote “Yes” or “No,” you can draw a random sample 
of digits from this table and observe whether each is odd or even. Since 
the digits are supposed to be thoroughly mixed, with no particular order 
or groupings, any sequence of them may be considered random; so, begin 
at any point in the table and count the number of odd digits in a sample 
of some predetermined size. For example, suppose you decide on 50 
digits as the sample size, and elect to start at row 31, column 17. You find 

752 54450 19031 . . . 

Read on to the end of this row, then return to the beginning of row 32 and 
continue till 50 digits have been surveyed, ending with 

33851 44 

In this particular sample you will count 23 odd digits; that is, 46% of 
the sample of 50 arc odd. 

Three ways of introducing elements of chance into your sampling will 
decrease the likelihood of duplicating the drawings of others, (i) Start at 
a randomly selected point for each sample. A good way to do this is to 
place your pencil aimlessly on some digit in the table, then use this and 
the subsequent three digits to fix the initial point of your first sample. As 
an example, suppose your pencil point picks out 2 in row 80, column 84. 
Let this with the three digits following, 2061, specify row 20, column 61 
as the point to begin sample number one. For the second sample, start 
at a point fixed by the first four (or last four or any other convenient four) 
digits of the first one, and so on. (ii) Vary the direction in which you 
traverse the table; to the right or left, up or down, or diagonally, (iii) Vary 
the sample size, choosing in some random manner the various sizes, 
10, 15, . . . 250, heading table 1.1 (1,000 is unnecessarily large). 

For each sample make a record of the following facts: (i) initial point, 
(ii) direction taken in counting, (iii) sample size, and (iv) the number of 
odd digits. If you can spare the time, draw 100 or more samples, carefully 
preserving the records, in the order made, for future use. 
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TABLE 1.2 

Ten Thousand Randomly Assorted Digits 



00-04 

05-09 

10-14 

15-19 

20-24 

25-29 

30-34 

35-39 

40-44 

45-49 

00 

54463 

22662 

65905 

70639 

79365 

67382 

29085 

69831 

47058 

08186 

01 

15389 

85205 

18850 

39226 

42249 

90669 

96325 

23248 

60933 

26927 

02 

85941 

40756 

82414 

02015 

13858 

78030 

16269 

65978 

01385 

15345 

03 

61149 

69440 

11286 

88218 

58925 

03638 

52862 

62733 

33451 

77455 

04 

05219 

81619 

10651 

67079 

92511 

59888 

84502 

72095 

83463 

75577 

05 

41417 

98326 

87719 

92294 

46614 

50948 

64886 

20002 

97365 

30976 

06 

28357 

94070 

20652 

35774 

16249 

75019 

21145 

05217 

47286 

76305 

07 

17783 

00015 

10806 

83091 

91530 

36466 

39981 

62481 

49177 

75779 

08 

40950 

84820 

29881 

85966 

62800 

70326 

84740 

62660 

77379 

90279 

09 

82995 

64157 

66164 

41180 

10089 

41757 

78258 

96488 

88629 

37231 

10 

96754 

17676 

55659 

44105 

47361 

34833 

86679 

25930 

53249 

27083 

11 

34357 

88040 

53364 

71726 

45690 

66334 

60332 

22554 

90600 

71113 

12 

06318 

37403 

49927 

57715 

50423 

67372 

63116 

48888 

21505 

80182 

13 

62111 

52820 

07243 

79931 

89292 

84767 

85693 

73947 

22278 

11551 

14 

47534 

09243 

67879 

00544 

23410 

12740 

02540 

54440 

32949 

13401 

15 

98614 

75993 

84460 

62846 

59844 

14922 

48730 

73443 

48167 

34770 

16 

24856 

03648 

44898 

09351 

98795 

18644 

39765 

73058 

90368 

44104 

17 

96887 

12479 

80621 

66223 

86085 

78285 

02432 

53342 

42846 

94771 

18 

90801 

21472 

42815 

77408 

37390 

76766 

5263 5 

32141 

30268 

18106 

19 

55165 

77312 

83666 

36028 

28420 

70219 

81369 

41943 

47366 

41067 

20 

75884 

12952 

84318 

95108 

72305 

64620 

91318 

89872 

45375 

85436 

21 

16777 

37116 

58550 

42958 

21460 

43910 

01175 

87894 

81378 

10620 

22 

46230 

43877 

80207 

88877 

89380 

32992 

91380 

03164 

98656 

59337 

23 

42902 

66892 

46134 

01432 

94710 

23474 

20423 

60137 

60609 

13119 

24 

81007 

00333 

39693 

28039 

10154 

95425 

39220 

19774 

31782 

49037 

25 

68089 

01122 

51111 

72373 

06902 

74373 

96199 

97017 

41273 

21546 

26 

20411 

67081 

89950 

16944 

93054 

87687 

96693 

87236 

77054 

33848 

27 

58212 

13160 

06468 

15718 

82627 

76999 

05999 

58680 

96739 

63700 

28 

70577 

42866 

24969 

61210 

76046 

67699 

42054 

12696 

93758 

03283 

29 

94522 

74358 

71659 

62038 

79643 

79169 

44741 

05437 

39038 

13163 

30 

42626 

86819 

85651 

88678 

17401 

03252 

99547 

32404 

17918 

62880 

31 

16051 

33763 

57194 

16752 

54450 

19031 

58580 

47629 

54132 

60631 

32 

08244 

27647 

33851 

44705 

94211 

46716 

11738 

55784 

95374 

72655 

33 

59497 

04392 

09419 

89964 

51211 

04894 

72882 

17805 

21896 

83864 

34 

97155 

13428 

40293 

09985 

58434 

01412 

69124 

82171 

59058 

82859 

35 

98409 

66162 

95763 

47420 

20792 

61527 

20441 

39435 

11859 

41567 

36 

45476 

84882 

65109 

96597 

25930 

66790 

65706 

61203 

53634 

22557 

37 

89300 

69700 

50741 

30329 

11658 

23166 

05400 

66669 

48708 

03887 

38 

50051 

95137 

91631 

66315 

91428 

12275 

24816 

68091 

71710 

33258 

39 

31753 

85178 

31310 

89642 

98364 

02306 

24617 

09609 

83942 

22716 

40 

79152 

53829 

77250 

20190 

56535 

18760 

69942 

77448 

33278 

48805 

41 

44560 

38750 

83635 

56540 

64900 

42912 

13953 

79149 

18710 

68618 

42 

68328 

83378 

63369 

71381 

39564 

05615 

42451 

64559 

97501 

65747 

43 

46939 

38689 

58625 

08342 

30459 

85863 

20781 

09284 

26333 

91777 

44 

83544 

86141 

15707 

96256 

23068 

13782 

08467 

89469 

93842 

55349 

45 

91621 

00881 

04900 

54224 

46177 

55309 

17852 

27491 

89415 

23466 

46 

91896 

67126 

04151 

03795 

59077 

11848 

12630 

98375 

52068 

60142 

47 

55751 

62515 

21108 

80830 

02263 

29303 

37204 

96926 

30506 

09808 

48 

85156 

87689 

95493 

88842 

00664 

55017 

55539 

17771 

69448 

87530 

49 

07521 

56898 

12236 

60277 

39102 

62315 

12239 

07105 

11844 

01117 
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TABLE 1.2— (Continued) 



50-54 

55-59 

60-64 

65-69 

70-74 

75-79 

80-84 

85-89 

90-94 

95-99 

00 

59391 

58030 

52098 

82718 

87024 

82848 

04190 

96574 

90464 

29065 

01 

99567 

76364 

77204 

04615 

27062 

96621 

43918 

01896 

83991 

51141 

02 

10363 

97518 

51400 

25670 

98342 

61891 

27101 

37855 

06235 

33316 

03 

86859 

19558 

64432 

16706 

99612 

59798 

32803 

67708 

15297 

28612 

04 

11258 

24591 

36863 

55368 

31721 

94335 

34936 

02566 

80972 

08188 

05 

95068 

88628 

35911 

14530 

33020 

80428 

39936 

31855 

34334 

64865 

06 

54463 

47237 

73800 

91017 

36239 

71824 

83671 

39892 

60518 

37092 

07 

16874 

62677 

57412 

13215 

31389 

62233 

80827 

73917 

82802 

84420 

08 

92494 

63157 

76593 

91316 

03505 

72389 

96363 

52887 

01087 

66091 

09 

15669 

56689 

35682 

40844 

53256 

81872 

35213 

09840 

34471 

74441 

10 

99116 

75486 

84989 

23476 

52967 

67104 

39495 

39100 

17217 

74073 

11 

15696 

10703 

65178 

90637 

63110 

17622 

53988 

71087 

84148 

11670 

12 

97720 

15369 

51269 

69620 

03388 

13699 

33423 

67453 

43269 

56720 

13 

11666 

13841 

71681 

98000 

35979 

39719 

81899 

07449 

47985 

46967 

14 

71628 

73130 

78783 

75691 

41632 

09847 

61547 

18707 

85489 

69944 

15 

40501 

51089 

99943 

91843 

41995 

88931 

73631 

69361 

05375 

15417 

16 

22518 

55576 

98215 

82068 

10798 

86211 

36584 

67466 

69373 

40054 

17 

75112 

30485 

62173 

02132 

14878 

92879 

22281 

16783 

86352 

00077 

18 

80327 

02671 

98191 

84342 

90813 

49268 

95441 

15496 

20168 

09271 

19 

60251 

45548 

02146 

05597 

48228 

81366 

34598 

72856 

66762 

17002 

20 

57430 

82270 

10421 

05540 

43648 

75888 

66049 

21511 

47676 

33444 

21 

73528 

39559 

34434 

88596 

54086 

71693 

43132 

14414 

79949 

85193 

22 

25991 

65959 

70769 

64721 

86413 

33475 

42740 

06175 

82758 

66248 

23 

78388 

16638 

09134 

59880 

63806 

48472 

39318 

35434 

24057 

74739 

24 

12477 

09965 

96657 

57994 

59439 

76330 

24596 

77515 

09577 

91871 

25 

83266 

32883 

42451 

15579 

38155 

29793 

40914 

65990 

16255 

17777 

26 

76970 

80876 

10237 

39515 

79152 

74798 

39357 

09054 

73579 

92359 

27 

37074 

65198 

44785 

68624 

98336 

84481 

97610 

78735 

46703 

98265 

28 

83712 

06514 

30101 

78295 

54656 

85417 

43189 

60048 

72781 

72606 

29 

20287 

56862 

69727 

94443 

64936 

08366 

27227 

05158 

50326 

59566 

30 

74261 

32592 

86538 

27041 

65172 

85532 

07571 

80609 

39285 

65340 

31 

64081 

49863 

08478 

96001 

18888 

14810 

70545 

89755 

59064 

07210 

32 

05617 

75818 

47750 

67814 

29575 

10526 

66192 

44464 

27058 

40467 

33 

26793 

74951 

95466 

74307 

1 3330 

42664 

85515 

20632 

05497 

33625 

34 

65988 

72850 

48737 

54719 

52056 

01596 

03845 

35067 

03134 

70322 

35 

27366 

42271 

44300 

73399 

21105 

03280 

73457 

43093 

05192 

48657 

36 

56760 

10909 

98147 

34736 

33863 

95256 

12731 

66598 

50771 

83665 

37 

72880 

43338 

93643 

58904 

59543 

23943 

11231 

83268 

65938 

81581 

38 

77888 

38100 

03062 

58103 

47961 

83841 

25878 

23746 

55903 

44115 

39 

28440 

07819 

21580 

51459 

47971 

29882 

13990 

29226 

23608 

15873 

40 

63525 

94441 

77033 

12147 

51054 

49955 

58312 

76923 

96071 

05813 

41 

47606 

93410 

16359 

89033 

89696 

47231 

64498 

31776 

05383 

39902 

42 

52669 

45030 

96279 

14709 

52372 

87832 

02735 

50803 

72744 

88208 

43 

16738 

60159 

07425 

62369 

07515 

82721 

37875 

71153 

21315 

00132 

44 

59348 

11695 

45751 

15865 

74739 

05572 

32688 

20271 

65128 

14551 

45 

12900 

71775 

29845 

60774 

94924 

21810 

38636 

33717 

67598 

82521 

46 

75086 

23537 

49939 

33595 

13484 

97588 

28617 

17979 

70749 

35234 

47 

99495 

51434 

29181 

09993 

38190 

42553 

68922 

52125 

91077 

40197 

48 

26075 

31671 

45386 

36583 

93459 

48599 

52022 

41330 

60651 

91321 

49 

13636 

93596 

23377 

51133 

95126 

61496 

42474 

45141 

46660 

42338 
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TABLE 1 .2 — (Continued) 



00-04 

05-09 

10-14 

15-19 

20 -24 

25-29 

30-34 

35 -39 

40 * 44 

45-49 

50 

64249 

63664 

39652 

40646 

97306 

31741 

07294 

84149 

46797 

82487 

51 

26538 

44249 

04050 

48174 

65570 

44072 

40192 

51153 

11397 

58212 

52 

05845 

00512 

78630 

55328 

18116 

69296 

91705 

86224 

29503 

57071 

53 

74897 

68373 

67359 

51014 

33510 

83048 

17056 

72506 

82949 

54600 

54 

20872 

54570 

35017 

88132 

25730 

22626 

86723 

91691 

13191 

77212 

55 

31432 

96156 

89177 

75541 

81355 

24480 

77243 

76690 

42507 

84362 

56 

66890 

61505 

01240 

00660 

05873 

13568 

76082 

79172 

57913 

93448 

57 

48194 

57790 

79970 

33106 

86904 

48119 

52503 

24130 

72824 

21627 

58 

11303 

87118 

81471 

52936 

08555 

28420 

49416 

44448 

04269 

27029 

59 

54374 

57325 

16947 

45356 

78371 

10563 

97191 

53798 

12693 

27928 

60 

64852 

34421 

61046 

90849 

13966 

39810 

42699 

21753 

76192 

10508 

61 

16309 

20384 

09491 

91588 

97720 

89846 

30376 

76970 

23063 

35894 

62 

42587 

37065 

24526 

72602 

57589 

98131 

37292 

05967 

26002 

51945 

63 

40177 

98590 

97161 

41682 

84533 

67588 

62036 

49967 

01990 

72308 

64 

82309 

76128 

93965 

26743 

24141 

04838 

40254 

26065 

07938 

76236 

65 

79788 

68243 

59732 

04257 

27084 

14743 

17520 

95401 

55811 

76099 

66 

40538 

79000 

89559 

25026 

42274 

23489 

34502 

75508 

06059 

86682 

67 

64016 

73598 

18609 

73150 

62463 

33102 

45205 

87440 

96767 

67042 

68 

49767 

12691 

17903 

93871 

99721 

79109 

09425 

26904 

07419 

76013 

69 

76974 

55108 

29795 

08404 

82684 

00497 

51126 

79935 

57450 

55671 

70 

23854 

08480 

85983 

96025 

50117 

64610 

99425 

62291 

86943 

21541 

71 

68973 

70551 

25098 

78033 

98573 

79848 

31778 

29555 

61446 

2 3037 

72 

36444 

93600 

65350 

14971 

25325 

00427 

52073 

64280 

18847 

24768 

73 

03003 

87800 

07391 

11594 

21196 

00781 

32550 

57158 

58887 

73041 

74 

17540 

26188 

36647 

78386 

04558 

61463 

57842 

90382 

77019 

24210 

75 

38916 

55809 

47982 

41968 

69760 

79422 

80154 

91486 

19180 

15100 

76 

64288 

1 9843 

69122 

42502 

48508 

28820 

59933 

72998 

99942 

10515 

77 

86809 

51564 

38040 

39418 

49915 

19000 

58050 

16899 

79952 

57849 

78 

99800 

99566 

14742 

05028 

30033 

94889 

53381 

23656 

75787 

59223 

79 

92345 

31890 

95712 

08279 

91794 

94068 

49337 

88674 

35355 

12267 

80 

90363 

65162 

32245 

82279 

79256 

80834 

06088 

99462 

56705 

06118 

81 

64437 

32242 

48431 

04835 

39070 

59702 

31508 

60935 

22390 

52246 

82 

91714 

53662 

28373 

34333 

55791 

74758 

51144 

18827 

10704 

76803 

83 

20902 

17646 

31391 

31459 

33315 

03444 

55743 

74701 

58851 

27427 

84 

12217 

86007 

70371 

52281 

14510 

76094 

96579 

54853 

78339 

20839 

85 

45177 

02863 

42307 

53571 

22532 

74921 

17735 

42201 

80540 

54721 

86 

28325 

90814 

08804 

52746 

47913 

54577 

47525 

77705 

95330 

21866 

87 

29019 

28776 

56116 

54791 

64604 

08815 

46049 

71186 

34650 

14994 

88 

84979 

81353 

56219 

67062 

26146 

82567 

33122 

14124 

46240 

92973 

89 

50371 

26347 

48513 

63915 

11158 

25563 

91915 

18431 

92978 

11591 

90 

53422 

06825 

69711 

67950 

64716 

18003 

49581 

45378 

99878 

61130 

91 

67453 

35651 

89316 

41620 

32048 

70225 

47597 

33137 

31443 

51445 

92 

07294 

85353 

74819 

23445 

68237 

07202 

99515 

62282 

53809 

26685 

93 

79544 

00302 

45338 

16015 

66613 

88968 

14595 

63836 

77716 

79596 

94 

64144 

85442 

82060 

46471 

24162 

39500 

87351 

36637 

42833 

71875 

95 

90919 

11883 

58318 

00042 

52402 

28210 

34075 

33272 

00840 

73268 

96 

06670 

57353 

86275 

92276 

77591 

46924 

60839 

55437 

03183 

13191 

97 

36634 

93976 

52062 

83678 

41256 

60948 

18685 

48992 

19462 

96062 

98 

75101 

72891 

85745 

67106 

26010 

62107 

60885 

37503 

55461 

71213 

99 

05112 

71222 

72654 

51583 

05228 

62056 

57390 

42746 

39272 

96659 
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TABLE 1.2 — ( Continued ) 



50-54 

55-59 

60-64 

65-69 

70-74 

75-79 

80-84 

85-89 

90-94 

95-99 

50 

32847 

31282 

03345 

89593 

69214 

70381 

78285 

20054 

91018 

16742 

51 

16916 

00041 

30236 

55023 

14253 

76582 

12092 

86533 

92426 

37655 

52 

66176 

34047 

21005 

27137 

03191 

48970 

64625 

22394 

39622 

79085 

53 

46299 

13335 

12180 

16861 

38043 

59292 

62675 

63631 

37020 

78195 

54 

22847 

47839 

45385 

23289 

47526 

54098 

45683 

55849 

51575 

64689 

55 

41851 

54160 

92320 

69936 

34803 

92479 

33399 

71160 

64777 

83378 

56 

28444 

59497 

91586 

95917 

68553 

28639 

06455 

34174 

11130 

91994 

57 

47520 

62378 

98855 

83174 

13088 

16561 

68559 

26679 

06238 

51254 

58 

34978 

63271 

13142 

82681 

05271 

08822 

06490 

44984 

49307 

62717 

59 

37404 

80416 

69035 

92980 

49486 

74378 

75610 

74976 

70056 

15478 

60 

32400 

65482 

52099 

53676 

74648 

94148 

65095 

69597 

52771 

71551 

61 

89262 

86332 

51718 

70663 

11623 

29834 

79820 

73002 

84886 

03591 

62 

86866 

09127 

98021 

03871 

27789 

58444 

44832 

36505 

40672 

30180 

63 

90814 

14833 

08759 

74645 

05046 

94056 

99094 

65091 

32663 

73040 

64 

19192 

82756 

20553 

58446 

55376 

88914 

75096 

26119 

83898 

43816 

65 

77585 

52593 

56612 

95766 

10019 

29531 

73064 

20953 

53523 

58136 

66 

23757 

16364 

05096 

03192 

62386 

45389 

85332 

18877 

55710 

96459 

67 

45989 

96257 

23850 

26216 

23309 

21526 

07425 

50254 

19455 

29315 

68 

92970 

94243 

07316 

41467 

64837 

52406 

25225 

51553 

31220 

14032 

69 

74346 

59596 

40088 

98176 

17896 

86900 

20249 

77753 

19099 

48885 

70 

87646 

41309 

27636 

45153 

29988 

94770 

07255 

70908 

05340 

99751 

71 

50099 

71038 

45146 

06146 

55211 

99429 

43169 

66259 

97786 

59180 

72 

10127 

46900 

64984 

75348 

04115 

33624 

68774 

60013 

35515 

62556 

73 

67995 

81977 

18984 

64091 

02785 

27762 

42529 

97144 

80407 

64524 

74 

26304 

80217 

84934 

82657 

69291 

35397 

98714 

35104 

08187 

48109 

75 

81994 

41070 

56642 

64091 

31229 

02595 

13513 

45148 

78722 

30144 

76 

59537 

34662 

79631 

89403 

65212 

09975 

06118 

86197 

58208 

16162 

77 

51228 

10937 

62396 

81460 

47331 

91403 

95007 

06047 

16846 

64809 

78 

31089 

37995 

29577 

07828 

42272 

54016 

21950 

86192 

99046 

84864 

79 

38207 

97938 

93459 

75174 

79460 

55436 

57206 

87644 

21296 

43395 

80 

88666 

31142 

09474 

89712 

63153 

62333 

42212 

06140 

42594 

43671 

81 

53365 

56134 

67582 

92557 

89520 

33452 

05134 

70628 

27612 

33738 

82 

89807 

74530 

38004 

90102 

11693 

90257 

05500 

79920 

62700 

43325 

83 

18682 

81038 

85662 

90915 

91631 

22223 

91588 

80774 

07716 

12548 

84 

63571 

32579 

63942 

25371 

09234 

94592 

98475 

76884 

37635 

33608 

85 

68927 

56492 

67799 

95398 

77642 

54913 

91853 

08424 

81450 

76229 

86 

56401 

63186 

39389 

88798 

31356 

89235 

97036 

32341 

33292 

73757 

87 

24333 

95603 

02359 

72942 

46287 

95382 

08452 

6286 2 

97869 

71775 

88 

17025 

84202 

95199 

62272 

06366 

16175 

97577 

99304 

41587 

03686 

89 

% 

02804 

08253 

52133 

20224 

68034 

50865 

57868 

22343 

55111 

03607 

90 

08298 

03879 

20995 

19850 

73090 

13191 

18963 

82244 

78479 

99121 

91 

59883 

01785 

82403 

96062 

03785 

03488 

12970 

64896 

38336 

30030 

92 

46982 

06682 

62864 

91837 

74021 

89094 

39952 

641 58 

79614 

78235 

93 

31121 

47266 

07601 

02051 

67599 

24171 

69843 

83696 

71402 

76287 

94 

97867 

56641 

63416 

17577 

30161 

87320 

37752 

73276 

48969 

41915 

95 

57364 

86746 

08415 

14621 

49430 

22311 

15836 

72492 

49372 

44103 

96 

09559 

26263 

69511 

28064 

75999 

44540 

13337 

10918 

79846 

54809 

97 

53873 

55571 

00608 

42661 

91332 

63956 

74087 

59008 

47493 

99581 

98 

35531 

19162 

86406 

05299 

77511 

24311 

57257 

22826 

77555 

05941 

99 

28229 

88629 

25695 

94932 

30721 

16197 

78742 

34974 

97528 

45447 
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Section 1.5 


You have now done experimentally what the mathematical statistician 
does theoretically when he studies the distribution of samples drawn at 
random from a specified population, and you can verify the results which 
he derives from his equations. First count the number of samples whose 
percentage of odd digits is greater than 50.6%, which is the exact fraction 
of odds in the population of 10,000. Approximately half of your percentages 
should be above this population value and half below. Do you find much 
sampling deviation from half-and-half? If you have 100 samples, you 
need not be surprised if 45 percentages are on one side of 50.6% and 55 
on the other. The second point to observe is a characteristic of nearly all 
sampling which should be rather obvious in yours; that is, the tendency 
of the sample percentages to concentrate near the population 50.6%. 
How many of yours were between say, 45% and 55%? This central 
tendency is an idea which should counterbalance the perhaps discourag- 
ing fact that some samples are notably divergent. 

Next, you have evidence of the unbiased nature of the sampling. Look 
over the percentages in all your samples of 10 and note the greatest devia- 
tion from 50.6%. Do the same with your samples of 15, 20, etc. The 
larger samples should not have such pronounced deviations from 50.6% 
as are notable in some of the smaller. This verifies the fact that as the 
sample size increases its average tends towards the population value; not 
invariably, indeed, but with constantly increasing probability of small 
deviations. 

Finally, you can verify the theory of the confidence interval. From 
table 1.1, write down the 95% interval for each of your samples. Thus, 
for the sample of 50 discussed above, 23 of the digits were odd. This 
leads to the statement that the population percentage lies between 32% 
and 61%, a correct conclusion because we know that 50 6% of the digits 
in the table arc odd. But suppose one of your samples of 250 had started 
at row 85, column 23, you would have counted, moving down the suc- 
cessive columns, only 101 or 40.4% odd, and would have asserted that 
the true value is between 34% and 46%. You would have been wrong 
despite the fact that this sample is randomly drawn from the same popula- 
tion as the others. It merely happens to be unusually divergent. You 
should find about five samples in a hundred leading you to incorrect 
statements, but there will be no occasion for surprise if only three, or as 
many as seven, turn up. If you now examine the intervals for P — 0.99, 
you may expect (allowing as usual for sampling variation) about 99 per 
100 of these to include the true ratio of 50.6%. I hope your results may 
be sufficiently concordant with theory to give you confidence in it. Of 
one thing I am sure — you are more aware of the vagaries of sampling 
than you were before, and that is one of the objects of the experiment. 
Another lesson to be learned is that only broad confidence intervals can 
be based on small samples, and that even so the inference can be wrong. 
Also, one finds that refinement of the statements increases but slowly with 
sample size. 
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Let me emphasize the point that failure to make correct inferences in 
a small portion of samples is not a fault that can be remedied but is 
inevitably bound up in the sampling procedure. Fallibility is the very 
nature of such evidence. The sampler can only take available precau- 
tions, then prepare himself for his share of mistakes. In this he is not 
alone. The journalist, the judge, the banker, the weather forecaster — 
these and most other people are subject to the laws of chance, and each 
makes his own quota of wrong guesses. The statistician has this advan- 
tage — he can, in favorable circumstances, know exactly his likelihood of 
error. 


EXAMPLE 1.10 -If you wish to learn by sampling what portion of radio owners 
in a selected region listen to a certain program, would you get an unbiased sample 
if you conducted the interviews by telephone? Would all radio owners have the same 
chance of being interviewed? 

EXAMPLE 1 1 1- A sampler of public opinion estimates from a sample the number 
of voters in a state favoring a certain candidate for governor. Assuming that his 
estimate was very close to the population value at the time the survey was made, 
might the ballot on election day be quite different? 

EXAMPLE M2 — Why does a mailed questionnaire often result in a biased esti- 
mate? 


EXAMPLE 1 13— Combine all your samples of digits, irrespective of size, into 
one I he correct percentage of odd digits can be got in two ways: (i) add all the 
counts of odd digits and divide by the total digits counted; (li) compute the weighted 
average of the percentages of the various sized samples, using the formula, 


weighted average = - 


Wyp± + T W tpj 4- . . 

Wi + Ifj -f II i + . . . 


where p\ is the percentage of odds in the samples of 10 and is the total number of 
digits counted in these samples. p>i and are defined in the same way for samples of 
15, etc. 

EXAMPLE 1 14— If you have 100 swine to be distributed at random into 10 lots’ 
how can the allotment be made by means of table 1 2? 

1.6 — Hypotheses about populations. In contrast with the foregoing, 
the investigator often has in mind a definite hypothesis about the popu- 
lation ratio, the purpose of the sampling being to get evidence concerning 
his hypothesis. Thus, a geneticist studying heredity in the tomato had 
reason to believe that, in the F% generation, segregation of fruits with 
red flesh and yellow flesh would be in the ratio, 3:1 . Counting the number 
of fruits with red flesh in a sample of 400 he found 310 instead of the ex- 
pected 300. With your experience of sampling variation, would you 
accept this as verification or refutation of the hypothesis? Again, the 
sampler of public opinion foresees a close election in some critical state. 
He sets up the hypothesis of an equal number of votes for the two domi- 
nant parties, then asks 10,000 voters their preferences, the resulting divi- 
sion being 5,100 to 4,900. Shall he say that this is or is not convincing 
evidence about his hypothesis of a tie vote? 

To answer such questions two data are needed, a measure of the 
deviation of the sample from the hypothetical population ratio, and a 
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means of judging whether or not this measure is an amount commonly 
expected in sampling or, on the contrary, is so great as to throw doubt 
upon the hypothesis. Both data were furnished by Karl Pearson in 1899 
(11). He devised an index of dispersion denoted by x 2 (chi-square), 
and provided a table of its probable occurrence in sampling.* Let us 
first examine the index. 

1.7 — Chi-square, an index of dispersion. Naturally, the deviations 
of the observed numbers from those specified by the hypothesis form the 
basis of the index. In the election sampling of the previous section, the 
numbers expected on the assumption of an equally divided vote arc each 
5,000. The deviations, then, are 

5,100 - 5,000 = 100 

and 

4,900 - 5,000 - - 100, 

the sum of the two being zero. The value of chi-square is given by, 

, = _(1 00)’ (-loop = 

x 5,000 5,000 

Each deviation is squared, each square is divided by the expected num- 
ber, and the results are added. The expected numbers appear in the 
denominators in order to introduce sample size into the quantity — it is 
the relative size of the deviation which is important. The squaring may 
puzzle you. You will find this a very common practice in statistics. 
Incidentally, it makes the sign of the deviation unimportant since the 
square of a negative number is the same as that of the corresponding 
positive. It is plain that chi-square would be zero if the sample totals 
were the same as those expected, and that it will increase with increasing 
deviation from expected. In that respect, it is a reasonable measure of 
variation. But it is not at all plain whether the chi-square value of 4 
is to be considered large, medium, or small. To furnish you a basis for 
comparison is our next aim. Pearson founded his judgment on the solu- 
tion and tabulation of a somewhat elaborate equation, but for non- 
mathematicians the customary way of going about the matter is to set 
up an experiment. Before doing that, a couple of formulas will be given, 
together with a few examples to help fix them in mind. 

1.8 — Symbolical statement of chi-square. It is convenient to repre- 
sent by X\ and X 2 the sample counts of individuals who do and do not 
possess the attribute being investigated, the corresponding expected 
numbers being mi and m 2 . The two deviations, then, are X\ — mi and 
X 2 m 2 , so that chi-square is given by the formula, 

X 2 = (Xi — mi) 2 /mi + (A r 2 ~ m 2 ) 2 /m 2 

* G. U. Yule and R. A. Fisher discovered an error in certain uses of the table and 
corrected it in 1922 (17) (4). 
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The formula may be condensed to the more easily remembered as well 
as more general one, 

x 2 = S(X - m) 2 /m, 


where S denotes summation. In words, “ Chi-square is the sum of such 
ratios as 

(deviation square)/(cxpected number)” 

As will appear later, there may be more than two of these ratios in the 
summation. 

Let us apply the formula to the counts of red and yellow tomatoes in 
section 1.6. There, X\ = 310, Xi = 400 — 310 = 90, m\ = % of 400 
= 300, and m 2 = ^ of 400 = 100. Whence, 

2 __ (310 - 3 00) 2 (90 - 100) 2 __ 

x '"300 + ’ 100 

It is apparent that the hypothesis set up about the population is the 
sole criterion for dividing the sample into expected numbers. Only if the 
investigator wishes to test some hypothesis will he be interested in calcu- 
lating chi-square. 

EXAMPLE 1.15 -“In a certain cross of two varieties of peas genetic theory led the 
investigator to expect half wrinkled seeds and half smooth. In a sample of 800 seeds 
examined he found 440 wrinkled. What is the value of chi-square-* Ans 8. 

EXAMPLE 116 —If the count in the foregoing example had been 220 wrinkled in 
a total of 400, would chi-square also be half its original valued 

EXAMPLE 1.17 — In the text example about tomatoes the deviation from expected 
was 10. If this same deviation had occurred in a sample of twice the size (that is, in a 
sample of 800), what would have been the value of chi-square^ Ans. 0.67, half the 
original value. 

EXAMPLE 1 18 —In a political campaign a candidate claimed that 60% of the 
electorate would vote for him. A sampler of public opinion asked 1,000 registered 
voters if they expected to vote for this candidate and 55% of them said, “Yes.” Calcu- 
late the value of chi-square on the assumption that the candidate’s claim was correct. 
Ans 10.42 

EXAMPLE 1 10 —In the foregoing example, what statement about the electorate 
may the sampler make based on the 99% confidence interval? Ans. Between 51 % and 
59% will vote for the candidate. 

1.9— An experiment in sampling chi-square. You have now had 
some practice in the calculation of chi-square. It is to be hoped that it 
appeals to you as a reasonable and informative measure of the deviation 
of the observed sample numbers from those expected under the hypothesis 
set up. But it has another function — it enables one to judge whether the 
sample ratio itself departs much or little from the hypothetical population 
value. To use it for that purpose we must answer the question already 
proposed: what values of chi-square are to be considered as indicating 
unusual deviation, and what as ordinary sampling variation? Our experi- 
mental method of answering the question will be to calculate chi-square 
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for each of the samples you drew from the table of random numbers, then 
observe what values of chi-square spring from the more unusual samples. 
If a large number of samples of various sizes have been drawn, and if the 
value of chi-square is computed from each, the distribution of chi-square 
may be mapped. 

To illustrate the calculation of chi-square for one of your samples 
suppose the sample size is n = 100; then, since 50.6% of the tabular 
digits are odd, mi — (0.506) (100) = 50.6 and = 49.4. If you counted 
A\ = 44 odd digits in this sample, the value of chi-square is 

2 = (44 - W (56 - 49.4)2 = 

50.6 49.4 

In case there are several samples of this same size, calculating time 
can be saved by rearranging the formula. Remembering that the two 
deviations are the same in absolute value (that is, disregarding sign), this 
absolute value being 6.6 for each deviation, chi-square can be written 

x 2 = (A" — m) 2 (l/m! + 1 /m 2 ) 

= (6.6) 2 (1 / 50.6 + 1/49.4) 

= (43.56)(0.019763 + 0 020243) 

= (43.56) (0.040006) = 1.74 

as before. The advantage is that once the sum of the reciprocals of the 
expected numbers has been calculated, it can be used again and again 
so long as the sample size and the population ratio are unchanged: just 
square the deviation from expected and multiply by 0.040006. Of course, 
an appropriate factor must be calculated for each sample size. 

Those who do not have adequate computing facilities (table of squares, 
slide rule, calculating machine) may prefer to sacrifice some precision 
to save a good deal in the time required for the experiment just outlined. 
If they assume 50% of the tabular digits to be odd, then m i = m<i = nj 2, 
an integer in most of the samples. For n = 100, the formula above then 
becomes 

X 2 = (0.04) (X - 5 0) 2 

The effect on the frequency distribution may not be noticed, but the 
individual values of chi-square in large samples are obviously different. 

Proceed now to calculate chi-square for each of your samples. The 
results may be summarized conveniently, especially if there are a hundred 
or more of them, in tables and a graph to be described in the next sections. 

1.10 — The frequency distribution. Table 1.3 is a typical frequency 
distribution , or merely, distribution. Class intervals (0.00 to 0.49, etc.) of 
convenient size are chosen, then the values falling into each are counted 
(in this table the values are those of chi-square). The counts are called 
class frequencies. You may find it more convenient to halve the number of 
classes for your distribution, making the first extend from zero to 0.99. 
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TABLE 1.3 

Distribution of 230 Values of Chi-Square Calculated From Samples Drawn 

From Table 1.2 

Sample sizes — 10, 15, 20, 30, 50, 100, and 250 


Class Interval 

Frequency 

Class Interval 

Frequency 

0 

00-0 

49 

116 

6 

00- 

6 

.49 

0 

0 

50-0 

99 

39 

6 

50- 

6 

99 

1 

1 

00-1 

49 

18 

7 

00- 

7 

49 

0 

1 

50-1 

99 

22 

7 

50- 

7 

99 

0 

2 

.00-2 

49 

12 

8 

00- 

8, 

.49 

0 

2 

50-2 

99 

5 

8 

50- 

8 

99 

1 

3 

00-3 

49 

5 

9 

00- 

9 

49 

0 

3 

50-3 

99 

6 

9 

50- 

9 

99 

0 

4 

00-4 

49 

1 

10 

00- 

10 

49 

1 

4 

50-4 

99 

2 

10 

50- 

10 

99 

0 

5 

00 5 

49 

0 

11. 

00- 

11 

49 

1 

5 

50-5 

99 

0 







For more detailed information about the structure of statistical tables, 
see reference (15). 

Observe the concentration of sample chi-squares in the smallest class, 
practically half of them being less than 0.5. Attention has already been 
called to the fact that in the process of sampling, small deviations are pre- 
dominant. This tendency of sample ratios to deviate little from the 
population value is the foundation of confidence in sampling. But taking 
a less optimistic view, one must not overlook the samples with large devia- 
tions and chi-squares. The possibility of getting such as these makes for 
caution in drawing conclusions. In this sampling exercise we know the 
population ratio and are not led astray by discrepant samples, but in 
actual investigations, where the hypothesis set up is not known to be the 
right one, a large value of chi-square constitutes a dilemma. Shall we 
say that it denotes only an unusual sample from the hypothetical popula- 
tion, or shall we conclude that the hypothesis misrepresents the true popu- 
lation ratio? Statistical theory contains no certain answer. Instead, it 
furnishes an evaluation of the probability of possible sample deviations 
from the hypothetical population. If chi-square is large, the investigator 
is warned that the sample is an improbable one under his hypothesis. 
This is evidence to be added to that which he already possesses, all of it 
being the basis for his decisions. A more exact determination of prob- 
ability will be explained in the second section to follow. 

1.11 — Graphical representation. You will no doubt wish to represent 
your sample chi-squares in a graph such as figure 1.1. On the horizontal 
axis mark off intervals corresponding to the classes of the frequency 
distribution. Upon each construct a rectangle with altitude proportional 
to the corresponding class frequency, using the scale of frequencies at the 
left. This kind of graphical representation of the frequency distribution 
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CHI -SQUARE 

Fio. 1 l -Histogram representing frequency distribution of the 230 sample values 

of chi-square in table 1.3. 

is known as a histogram. The graph serves to emphasize both the concen- 
tration of small chi-squares at the left and the comparatively large size 
of the few at the right. Reference (1) contains excellent instructions for 
the making of graphs. 

1.12 — Frequency distribution with unequal class intervals. Two 

features of our chi-square distribution are yet to be examined: (i) how 
does it compare with the theoretical distribution, and (li) how may we 
evaluate moie exactly the probabilities of various chi-square sizes? Both 
features become evident if the class intervals are rearranged as in table 1.4, 
column 1. Very narrow at the start, these theoretically determined inter- 
vals increase in width so as to make the chi-square distribution symmetrical 
(5,6) as in column 4. Furthermore, they arc adjusted so that eac h contains 
a convenient percentage of chi-square values; in other words, so that the 
probabilities of specific values arc easily read. As an example', the last 
line of column 4 indicates the probability of 0.01 that, in random sampling, 
chi-square be larger than 6.635. 

The distribution of our sample of 230 chi-squares in columns 2 and 3 
is readily compared with the theoretical distribution of column 4. Clearly 
the two do not differ greatly. The discrepancies are due partly to sampling 
variation, but mainly to this fact: the theoretical distributions in columns 
4 and 5 are based on a ‘‘continuous” variable, one capable of taking all 
numerical values, while those in columns 2 and 3 arise from counting a 
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TABLE 1.4 

Frequency Distributions of Chi-Square With Unequal Class Intervals 


1 

2 

3 

4 

5 


Discontinuous Variable 

Continuous Variable 





Theoretical 


Distribution 

Percentage 

Theoretical 

Cumulative 


of 230 Sample 

Distribution 

Percentage 

Percentage 

Class Interval i 

Values 

of Column 2* 

Distribution 

Distribution 

0 -0 000157 

0 

0 

1 

100 

0 000157-0 000628 

0 

0 

1 

99 

0 000628-0 00393 

11 

4 78 

3 

98 

0 00393 -0 0158 

20 

8 70 

5 

95 

0 0158 -0.0642 

10 

4 35 

10 

90 

0 0642 -0 148 

28 

12 17 

10 

80 

0 148 -0 455 

47 

20 44 

20 

70 

0455 -1.074 

42 

18 26 

20 

50 

1 074 -1 642 

30 

13 04 

10 

30 

1.642 -2 706 

22 

9 56 

10 

20 

2 706 -3 841 

14 

6 09 

5 

10 

3 841 -5 412 

2 

0 87 

3 

5 

5 412 -6 635 

1 

0 44 

1 

2 

6 635 - 

3 

1 30 

1 

1 

Total 

230 

100 00 

100 


* Each class frequency of column 2 is expressed as a percentage of 230. 


“discontinuous” or “discrete” variable which jumps from one integral 
number to the next, leaving out intermediate values (fractions and 
irrationals). You may have observed, for example, that no sample of 
sizes which you have drawn can yield a chi-square in either of the first 
two class intervals, but if samples of 500 or more were counted an occa- 
sional one would have these small values. 

Column 5 contains a cumulative frequency distribution of the percentages 
in column 4. Beginning at the foot of this column 5, each entry is the sum 
of all the preceding ones in column 4, hence the name. The interpreta- 
tion is illustrated thus: the fourth-to-last entry, 10%, means that 10% 
of all samples in the theoretical distribution have chi-squares greater 
than 2.706. This is the measure of chi-square that we have been needing 
to enable us to say precisely which values are to be considered small and 
which large. The discussion will be completed in the next section. 


1.13 — Sample evidence about the hypothesis. Significant chi- 
square. It has been emphasized that the function of a sample is to furnish 
evidence about the population sampled. First we learned that the sample 
ratio is designed as an unbiased estimate of the ratio in the population. 
Next we found that the confidence interval contains the population per- 
centage unless a specified caprice of sampling has occurred. Now we have 
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acquired another piece of evidence associated with some hypothesis 
which the sampler may have about the population ratio — the probability 
of getting a random sample more divergent than the one in hand. 

Columns 1 and 5 of table 1 .4 indicate that 50% of sample chi-squares 
from a given population have values larger than 0.455. We may, there- 
fore, look upon this as an average value, exceeded as often as not in 
sampling. On the contrary, chi-squares greater than 3.841 are rare, 
while those beyond 6.635 occur on the average only once per hundred 
samples. Did you get values as large as these in your sampling? 

It has become customary to refer to any chi-square larger than 3.841 
as significant. In section 1.10 attention was called to the fact that such 
large values in your sampling signify no more than unusually divergent 
samples; but to the investigator, none too sure of his hypothesis, they may 
suggest that perhaps the sample is not really drawn from the population 
which he specified. Naturally, if he knew the real ratio in his population, 
as you do in yours, he would not be doing the experiment. The research 
worker often designs his samplings for the purpose of testing some hy- 
pothesis, accepting or rejecting it on the evidence supplied by the sample. 
A significant value of chi-square denotes a sample so discrepant as to bring 
into doubt the hypothesis set up; in other words, any chi-square beyond 
3.841 is large and suggests rejection of the hypothesis. The larger the 
value of chi-square, the stronger the evidence against the hypothesis. 

Before continuing this discussion we must anticipate a bias in associat- 
ing the theoretical cumulative distribution (table 1.4, column 5) with 
values of chi-square calculated from enumeration data. This bias leads 
to an underestimate of the cumulative percentage (or probability). 
Elimination of the bias is not important in large samples (200 or more) 
but it is so easily done that one might as well get into the habit of it when 
testing hypotheses. Merely calculate an adjusted value of chi-square, 
decreasing the absolute value of X — m by 0.5 before squaring, then 
read the table as usual. For an example, consider a sample of 10 digits 
with 8 of them odd. Since the expected number of odds is 5.06, the 
deviations are 

8 - 5.06 - 2.94 

2 - 4.94 = —2.94 


Subtracting 0.5 from each, ignoring the sign, and substituting the results 
in the formula for chi-square, 


Adjusted x 2 = — + 
J 5.06 


( — 2.44) 2 
4.94 ' 


2.38, 


instead of the unadjusted value, 3.46. The adjustment is even more 
easily applied in the formula of section 1.9. It becomes 

Adjusted x 2 = {\X — m\— 0.5) 2 (l/mi + 1/^), 
where the symbol, | X — m | , indicates the absolute value of the deviation, 
sign ignored. Substituting the data above, 
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Adjusted x 2 = (2.94 - 0.5)70/5.06 + 1/4.94) 

= (2.44) 2 (0.40006) = 2.38, 

as before. 

You will notice that adjusted chi-square is always less than the un- 
adjusted value. This leads to the reading of unbiased probabilities in 
column 5, probabilities that are greater than would be read from un- 
adjusted chi-squares. Note that the hypothesis is less liable to rejection 
when this correction of the tabular probability is made. For theoretical 
discussions of the correction for continuity see references (3), (5), (7), 
(8), (16). 

Look back, now, at the values of chi-square calculated in sections 
6, 7, and 8. The one in the election sample, x 2 = 4 (the adjusted value 
is 3.96) would be considered unusual if the electorate were really divided 
equally. It is likely that the sampler, observing this significant value of 
chi-square, would reject the hypothesis of a tie vote and would say that 
the election will go to the party which received the majority vote in his 
sample of 10,000. In saying this he will risk being wrong, because there 
are almost five chances in a hundred of getting a sample with chi-squarc 
greater than 3.96 in sampling from an equally divided population. Such 
is the kind of risk always run by those who base decisions on sampling. 

In the experiment with tomatoes, chi-square was 1.33, the adjusted 
value being 1.20. Since larger values than this would occur in some 27% 
of samples drawn from the hypothetical population, the geneticist would 
probably hold to the 3:1 hypothesis, saying that his chi-square is non- 
significant. In doing this he may be in error. Actually, the sample ratio 
was 310:90, and if there were no genetic theory to guide him, he would 
accept this as his estimate of the population value. Also, he knows well 
that there are various perturbations of the simple inheritance patterns, 
and he is on the lookout for them. Nevertheless, in the absence of other 
information, and in view of the design of the experiment, which was to 
test the 3:1 hypothesis, he would usually interpret the small chi-square 
as indicating only sampling variation from this genetic ratio. 

Some people adopt a rather slavish attitude toward tests of significance, 
rejecting the hypothesis if chi-square is more than 3.841 and accepting it 
if chi-square is less. This indicates inadequate appreciation of the nature 
of the information acquired by sampling. A sample furnishes evidence, 
not proof. This evidence is to be added to that already accumulated from 
experience and reports of other research. LTsually, also, there is collateral 
information accruing during the progress of the experiment. It is the 
investigator’s responsibility to integrate all this evidence and to reach a 
decision. He cannot evade this responsibility merely by citing a value of 
chi-square. 

EXAMPLE 1.20— -Calculate the adjusted value of chi-square for the sample of 
800 pea seeds of example 1.15. Ans. 7.80. The geneticist would most likely reject 
the 1:1 hypothesis for the population from which this sample was drawn. 
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EXAMPLE 1.21 — Calculate adjusted chi-square for the sample of 1,000 voters in 
example 1.18. Ans. 10.21. The sample constitutes convincing evidence that the candi- 
date’s hypothesis was wrong. 

EXAMPLE 1.22— -In some experiments on heredity in the tomato, MacArthur (10) 
counted 3,629 fruits with red flesh and 1,176 with yellow. This was in the Ft generation 
where the expected ratio was 3:1. Compute x 2 ~ 0.68. MacArthur concluded that 
“The discrepancies between the observed and expected ratios are not significant.” 

EXAMPLE 1.23 — In the same scries of experiments, MacArthur counted 671 
plants with green foliage and 569 with yellow. This was a backcross in which the 
theoretical ratio was 1 1. Chi-square = 8.23 is strong evidence against the 1:1 hy- 
pothesis, “resulting without doubt from a lower viability in the recessive class.” 

EXAMPLE 1.24 — In the South Dakota farm labor sample (section 1.2) 480 of 
the 1,000 farmers were classed as owners (or part owners), the remaining 520 being 
renters. It is known that, of nearly 7,000 farms in the region, 47% are owners. Assum- 
ing this to be the population percentage, calculate adjusted chi-square for the sample 
of 1,000. Ans. x 2 83 0.36. Does this increase your confidence in the randomness of the 
sampling? Such collateral evidence is often cited. The assumption is that if the sample 
is shown to be representative for one attribute it is probably representative also of the 
attribute under investigation, provided the two are related. 

EXAMPLE 1.25 — James Snedecor (13) tried the effect of injecting poultry eggs 
with female sex hormones. In one series 2 normal males were hatched together with 
19 chicks which were classified as either normal females or individuals with pro- 
nounced female characteristics. What is the probability of the ratio 2 T9, or one more 
extreme, in sampling from a population with equal numbers of the sexes? Ans x 2 
= 12.2, P » 0.0004. For a method of evaluating probabilities not in the table, see 
section 8.8. 

EXAMPLE 1 26 — In table 1.4, 42 + 30 + 22 + 14 + 2 + 1 + 3 « 114 sam- 
ples have chi-squares greater than 0.455, whereas 50% of 230 were expected. W r hat 
is the probability of drawing a more discrepant sample if the sampling is truly random? 
Ans. x 2 ~ 0.00435, P = 0.95. What is the probability of greater deviation from half- 
and-half than the deviation in your samples drawn in accord with section 1.5? 

The reader who is primarily interested in measurement statistics may wish now to turn forward 
to chapter 2, omitting the remainder of this one. The necessary preparation has been made in the 
foregoing sections. He could well return later to the rest of chapter 7 for more thorough study. 

1.14— Chi-square and sample size. From the formula, 
x 2 = S(X - my/m, 

it is clear that if the deviation, A" — m , is unchanged chi-square decreases 
as the size of sample increases, because, for any particular hypothesis, 
m varies directly as sample size. This is illustrated by the first example 
in table 1.5. Multiplying sample size, n i by four reduces chi-square to 
one-fourth its former value. The meaning is that, while a deviation of 10 
is unusual in a sample of 100, it is rather common in the larger sample — 
the larger the sample, the more latitude there is for sample variation. 

In the second example the deviation is increased fourfold along with 
sample size, the effect being to multiply the original chi-square by four. 
The deviation, 40, in the larger sample is much less likely than is 10 in 
the original. This shows that the latitude for sample variation does not 
increase directly with sample size. 

The deviation in example three is only doubled as the original sample 
size is quadrupled, chi-square remaining unchanged; that is, the deviation, 
20, in a sample of 400 has the same expectation as the deviation, 10, in 
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the sample one-fourth as large. Thus, for equal likelihood of occurrence 
the deviation must vary directly as the square root of n. 

Clear understanding of the illustrated principles is necessary if one is 
to appreciate the meaning of chi-square. Here is one application: results 
are often reported not in numbers originally counted but as percentages 
of individuals having the attribute; that is, as so many per hundred enumer- 
ated. It is clear that such percentages cannot be used directly in the 
calculation of chi-square except in the case where the sample size is an 
even hundred. In all other samples, before chi-square is computed the 
percentage must be converted to the actual number of individuals found 

TABLE 1.5 

Three Examples Illustrating Changes in Unadjusted Chi-Square Resulting 
From Quadrupling the Original Sample Size 



Original 

Random 

Sample 

Example 1 
Deviation 
Unchanged 

Example 2 
Deviation 
Increased 
Directly as n 

Example 3 
Deviation 
Increased 
as \ n 

Sample size, n 

100 

400 

400 

400 

Number having 





attribute 

60 

210 

240 

220 

Expected on basis 





of 1 : 1 hypothesis 

50 

200 

200 

200 

Deviation 

10 

10 

40 

20 

Chi-square 





(unadjusted) 

4 

1 

16 

4 


with the attribute, or else some appropriate device must be employed for 
accomplishing the same end. 

Another application of the principles of this section is made in planning 
the size of a sampling. Consider a circumstance like this: a geneticist 
studying heredity in the tomato has examined 100 seedlings and found 46 
with yellow foliage. Theoretically the plants are expected to segregate 
half green and half yellow, but he suspects lower viability in the seeds 
having the recessive yellow gene. The deviation of yellow seedlings from 
the expected 50 is 4% of the number examined, but chi-square (un- 
adjusted) is only 0.64, so the statistical evidence against the 1:1 ratio is 
slight. Nevertheless, the suspicion of lower viability must be tested, and 
the question is asked, “In the event that the 4% deviation is found in a 
larger sample, how many plants should be examined to make chi-square 
10 times its present sample value?” The appropriate example in table 1.5 
is the second because, the deviation being assumed to be 4% of the plants 
counted, its size is to increase directly as n. If so, chi-square will increase 
in the same way, and a sample of 10 times 100 should have the required 
unadjusted value, 6.4. MacArthur’s data in example 1.23 might have 
been the outcome of some such reasoning, verifying the suspected character 
of the gene. On the other hand, the suspicion might have proved un- 
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founded, the larger sample turning up with no more than ordinary devia- 
tion from the 1:1 ratio. In either event the investigator has acquired the 
desired information. 


1.15 — Another formula for chi-square. This formula gives the same 
result as the more general one, but often is easier of computation: 


,j _ (g - [W 

r{a + b) ’ 


where a and b are the numbers in the two classes and r is the hypothetical 
ratio of the corresponding numbers in the population. Applying this to 
the tomato problem of section 1.6: 


a = number of red fleshed fruits = 310 
b = number of yellow fleshed fruits == 90 
a + b = 400 
r = 3/1=3 




3 X 90) 2 


3(400) 


= 1.33, as before. 


The adjustment of chi-square is introduced rather simply: 

2 = (If ~ rb \ ~ (r + i)/2) 2 

r(a + b) 


_ (310 - 270 - (3 + l)/2) 2 
3(400) 

= 1.20, as in section 1.13 

Returning to the question of sample size, the formula of this section 
furnishes a convenient answer if the investigator, wishing to test the r:l 
hypothesis, is willing to rest his decision on a sample whose deviation is 
any amount up to p% of the sample size. This size is to be chosen so that 
if the deviation turns out to be p% then chi-square shall be significant. 
His thinking would be that any less deviation is inconsequential, but 
that if the p% deviation proves significant the fact will be worth knowing. 
With a little manipulation, the formula becomes 


[(1 + r)p/my 


where chi-square may be chosen at any desired level. As an example, 
suppose the 3:1 hypothesis is to be tested, and a deviation of 5% of n 
will be interesting if significant. Then r = 3, x 2 = 3.841 and p — 5%. 
Substituting, 


n = 


3(3.841) 
(4 X 0.05) 2 


= 288 
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If 288 plants are examined, and if the deviation from the expected 3:1 
segregation is 5% of 288 = 15 plants or more, then chi-square will be 
adjudged significant. 

EXAMPLE 1.27— Jenkins and Bell (9) reported a gene producing yellow seedlings 
in maize, a simple recessive to the normal green. Among 9,717 counted the ratio of 
green to yellow seedlings was 78.95% to 21.05%, whereas the expected ratio was 3:1. 
Does the sample evidence support the hypothetical 3:1 ratio? Ans. Adjusted x tvB 80.83. 
The investigators showed by means of further experiments that the gene was “linked 
with lethal or semi-lethal factors which caused a deficiency in the yellow seedling class. 
Satisfactory 3:1 ratios were not obtained until these cultures had been outcrossed and 
the disturbing factors eliminated.” 

1.16 — Ratios, rates, and percentages. These fractions are very popu- 
lar in the presentation of statistics because, being usually reduced to some 
common denominator, their numerators are readily compared. Examples 
are given in table 1.6. The percentages of males all have the denominator, 
100 guinea pigs. The two sets of ratios are numerators, respectively, 
of one and one hundred females. Other denominators are sometimes used. 
Death rates, for example, are usually expressed as so many per thousand 
people, or per 100,000 for some rare diseases. 

Ease of comparison in a set of these fractions is gained at the expense 

TABLE 1.6 

Average Numbers of Males and Females Littered in a Colony of Guinea Pigs, 
by Month. From Schott and Lambert (12) 

Various Sex Ratios 



Jan. 

Feb. 

Mar. 

Apr 

May 

June 


Males 

65 

64 

65 

41 


72 

80 


Females 

49 

58 

81 

48 


62 

80 


Total 

114 

122 

146 

89 


134 

160 


Percentage males 
Ratio, males to 

57.0 

52 5 

44 5 

46 

1 

53.7 

50 0 


females 

Ratio, males per 

1.327 

1 103 

0 802 

0 

854 

1.161 

1 000 


100 females 

132.7 

110 3 

80 2 

85 

4 

116 1 

100 0 












July 

Aug. 

Sept. 

Oct. 

Nov. 

Dec. 

Total 

Males 

88 

114 

80 

129 


112 

86 

996 

Females 

95 

118 

94 

104 


144 

85 

1,018 

Total 

183 

232 

174 

233 


256 

171 

2,014 

Percentage males 
Ratio, males to 

! 48.1 

49.1 

46.0 

55 

4 

43.8 

50.3 

49.45 

females 

Ratio, males per 

0.926 

0.966 

0.851 

1. 

.240 

0.778 

1.012 

0.9784 

100 females 

92.6 

96.6 

85.1 

124 

0 

77.8 

101.2 

97.84 
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of information. The original denominators are set aside, even sometimes 
lost sight of, a fault to be deplored. Many computations require return 
to the original data, as for example the fractions in the column of totals 
where' 

49.45% = (100)(996)/2,014, etc. 

Attention has already been called to the fact (section 1.14) that percent- 
ages cannot be used in the calculation of chi-square unless some method 
is provided for recovery of the original base, thus giving due weight to 
sample size. 

In many instances the bases of comparable percentages are not very 
different so that the percentages themselves can be averaged with no 
great inaccuracy. Even the guinea pig percentages of males have an 
average, 

57.0 + 52.5 + . . . + 5 0.3 = 596^5 ^ ~ 

12 12 ‘ /o 

which is not greatly divergent from the correct 49.45%. However, if one 
wishes an accurate average of percentages that have different bases he 
must go back to the original fractions, add their numerators and denomi- 
nators separately, then divide. 

EXAMPLE 1.28 — Three dairy herds of a certain community showed the following 
reaction to a test for tuberculosis: 


Number cows in herd 

40 

100 

10 

Percentage reactors 

5 

2 

60 


Calculate the average, 6 7%. Do you think this is a better average than (5 -j- 2 -f* 60)/ 
3 - 22.3% ? 

EXAMPLE 1.29 — The percentages of noxious weed seeds in two samples of timothy 
are 0.01% and 0.05%. If each sample consisted of 10,000 seeds, what is the average 
percentage in the two? Ans. 0.03%. 

EXAMPLE 1.30 — If the samples in the foregoing example were 80,000 and 20,000 
seeds, respectively, what would be the average? Ans. 0.018%, quite properly nearer 
the percentage of the larger sample, 

EXAMPLE 1.31 — Schott and Lambert reported that the numbers in table 1.6 
are averages over 7 years so that the total number of males was 6,972 and of females, 
7,126, the sex ratio being 97.84 males per 100 females. Test the hypothesis that the 
population sex ratio is 100. Ans. x 2 — 1.66. Note: If the averages were used, chi- 
square would be 0.219, only about one-seventh the correct value. Why isn’t it exactly 
one-seventh? 

1.17 — Suggestions and cautions. Summary. In reporting your 
sampling or experiment, bear in mind these principles, (i) The estimates, 
confidence statements and tests of hypotheses presumably summarize 
all the information in your data; consequently there is seldom need for 
publication of the numerical details, (ii) Space in journals is precious 
and should not be used for describing computational procedures or 
explaining the meaning of statistical terms. Those are functions of the 
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textbook. Your job is to report the facts of the sampling, to discuss the 
relation of your findings to the problem set up, to make decisions and 
draw conclusions, (iii) A well-constructed table often displays all the 
desired information about the statistics of an investigation. It leaves both 
the writer and reader free to consider the biological implications, un- 
hampered by numerical detail or such distracting comment as “the differ- 
ence is statistically significant.” (iv) The sampled population is specified 
by a description of the circumstances surrounding the experiment (com- 
pare section 1.2). Details of the species and varieties used, the breeding 
and rearing of animals, soil, fertilizer and season for plants, laboratory 
techniques — such are required in order that other data from the same 
population may be compared to yours with no more than allowable 
sampling differences between them. 

There is widespread misinformation concerning sample based infer- 
ences about populations. It may be well to point out two misstatements 
often met. First, about the 0.95 confidence interval, or more particularly 
about the similar concepts of standard and probable error, one often 
hears it said that “95% of other samples fiom the same population will 
have ratios lying within this interval.” If you look carefully through sec- 
tions 1.3 and 1.4, especially through your sampling of section 1.5, you 
will find no basis for such an inference. Only if the population ratio is 
known, as it was in the calculations of section 1.9, can the distribution 
of sample ratios be predicted by the methods which have been presented 
in this chapter. The fallacy of the statement is evident from your sampling 
of digits. For illustration, select one of the samples with a percentage 
rather divergent from 50.6%, then calculate the proportion of your 
other sample percentages falling within the confidence interval set. 
Perhaps less than half of them do. You may find some samples with 
percentages close to 50% for which the statement is true, but these are 
exceptions. 

Another common but erroneous inference follows from the chi-square 
test. If chi-square is small it is often said that “the odds are 19:1 that the 
hypothesis is correct.” I have presented no method for evaluating the 
probability of hypotheses. From the viewpoint adopted in studying chi- 
square the hypothesis is fixed, the probability evaluated being that of 
drawing a sample from it more extreme than the one in hand. It is true 
that after the evidence is all accumulated the investigator must make a 
decision about the hypothesis, and his decision has some presumably 
high probability of being correct, but we do not know how to evaluate 
such a probability. 

Summarizing this chapter, we have studied the sample as a source of 
information about a population. If randomly drawn it provides an un- 
biased estimate, a confidence interval and a test of the sampler’s hypothe- 
sis. By drawing a lot of samples we have verified some useful mathe- 
matical theorems about them, notably, the confidence interval and the 
distribution of chi-square. The chi-square coefficient of dispersion has 
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proved itself a powerful tool for extracting information about the popula- 
tion. 

In chapters 9 and 16 we shall return to the consideration of enumera- 
tion data. In those intervening, methods appropriate to measurements will 
occupy attention. 
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Chapter 2 

MEASUREMENT DATA. COMPARISON OF INDIVIDUALS 


2.1 — Introduction. In the first chapter only one kind of data was 
considered, that resulting from the enumeration of individuals possessed 
of some interesting attribute. Alive or dead, infested or free, red or white — 
such were the categories observed. Another type of data, even more 
common, derives from measuring some continuous variate like height 
or yield or income. How much do you weigh? How old are you? Numeri- 
cal facts like these are intimately related to all our thinking. In this and 
following chapters will be developed statistical methods appropriate to 
such measurement data. The principle involved is the same as that 
already presented: the collection and examination of a sample for the 
purpose of learning about the population. Unbiased estimates, confidence 
statements and tests of hypotheses will be the familiar concepts employed. 

2.2 — The mean as an unbiased estimate. Let us turn immediately 
to some reported data and look later at the pertinent theory. In 1936 the 
Council on Foods of the American Medical Association sampled the 
vitamin C content of commercially canned tomato juice by analyzing a 
specimen from each of the brands that stood accepted by the Council (4) . 
The data are shown in the second column of table 2.1. A commonly 
used average, the arithmetic mean or simply the mean , is calculated by 
dividing the sum of the 17 items by their number, 

333/17 = 19.6 milligrams of ascorbic acid per 100 grams of juice 
This average summarizes the results in a single value which, in some 
respects, is typical of the whole set. In size it is intermediate among the 
individual data. In successive samples from the same population the 
mean varies less than the observations, one of the reasons that give us 
confidence in using it. Another of these reasons is that, with the assump- 
tions to be described below, this sample average is an unbiased estimate 
of the population mean. We have thus reached again the familiar first 
inference of section 1.3 — the mean vitamin C content of the tomato juice 
manufactured in 1936 under the 17 specified brands is estimated without 
bias by the sample mean, 19.6 mg./lOO g. To make the second inference 
(section 2.12) some further preparation is necessary. 

2.3 — The range, a measure of variation. In looking through the 
vitamin C contents of the seventeen brands, one is struck by their vari- 
ability. From high to low, their range is 

29 — 13 — 16 mg. per 100 grams 
[ 31 ] 
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TABLE 2.1 

Vitamin C Content of 17 Specimens of Commercially Canned 
Tomato Juice (4), 1936 


Observation 

Vitamin C Content 

Deviation 

Deviation 

Number 

Mg. per 100 g. 

From Mean 

! 

Squared 

n 

Original 

X 

x * X 

- £ 

x 2 

1 

16 

16 

- 4 


16 

2 

22 

22 


+ 2 

4 

3 j 

21 

21 


+ 1 

1 

4 1 

20 

20 

0 


0 

5 

23 

23 


-f 3 

9 

6 

22 

21 


+ 1 

1 

7 

17 

19 

- 1 


1 

8 

15 

15 

- 5 


25 

9 

13 

13 

- 7 


49 

10 

22 

23 


+ 3 

9 

11 

17 

17 

- 3 


9 

12 

18 

20 

0 


0 

13 

29 

29 


+ 9 

81 

14 

17 

18 

- 2 


4 

15 

22 

22 


+ 2 

4 

16 

16 

16 

- 4 


16 

17 

23 

25 


+ 5 

25 

t 

Totals 

333 

340 

-26 

+26 

254 


£ = 340/17 = 20 mg. per 100 grams 
j 2 « 254/16 = 15.88 s = 3.98 mg /100 g. 

i-i 2 « 15.88/17 - 0.934 r* = 0.97 mg./lOO g. 


What are the causes of this variation? Different processes of manufacture, 
perhaps, and different sources of the fruit. Doubtless, also, the specimens 
examined, being themselves samples of their brands, differed from the 
brand means. Finally, the laboratory technique of evaluation is never 
perfectly accurate. Variation is the very essence of statistical data, and 
the range of a sample is a good rough-and-ready way to measure it. A 
more reliable estimate will be given in section 2 8 below. 

The mean and the range constitute a summary of the information 
contained in the entire set. The average vitamin G content is 19.6 mg 
per 100 grams with a range of 16 mg. per 100 grams. These two summary 
numbers convey the outstanding facts of the sample. 

2.4 — The array. To make succeeding computations simpler let us 
temporarily transfer attention to the slightly modified series in the third 
column of the table. Some of the more intimate features of this sample 
may be revealed by arranging it from low to high in an array like this: 

13, 15, 16, 16, 17, 18, 19, 20, 20, 21, 21, 22, 22, 23, 23, 25, 29 
The range is now obvious. Attention is also attracted to the concentration 
of the values near the center of the array. It is this central tendency of sets 
of data that gives rise to that faith we feel in samples and in estimates 
based on them. 
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2.5 — Graphical representation. Instead of presenting data in a 
table, it is sometimes more effective to display the facts graphically. 
Figure 2.1 is a graphical representation of the foregoing array of 17 
vitamin determinations. A dot represents each item. The distance of the 
dot from the vertical line at the left, proportional to the concentration 
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Fig. 2.1 — Graphical representation of an array Modified Vitamin G data. 


of ascorbic acid in a brand specimen, is read in milligrams per 100 grams 
on the horizontal scale. 

The diagram brings out vividly not only the variation and the con- 
centration in the sample, but also two other characteristics: (i) the rather 
symmetrical occurrence of the values above and below the mean, and 
(ii) the scarcity of both extremely small and extremely large vitamin G 
contents, the bulk of the items being near the middle of the set. These 
features recur with notable persistence in measurement data as well as 
in the samples of enumeration data examined in chapter 1. For many 
variables associated with living organisms there are averages and ranges 
peculiar to each, reflecting the manner in which each seems to express 
itself most successfully. These norms persist despite the fact that indi- 
viduals enjoy a considerable freedom in development. A large part of our 
thinking is built up around ideas corresponding to such statistics. The 
words, pig, daisy, man, each raises an image which is quantitatively 
described by summary numbers. It is difficult to conceive of any progress 
in thought until memories of individuals are collected into concepts like 
averages and ranges. 
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2.6 — Symbolical representation. The items in any set may be 
represented by 

X u Xu Xu . . . X ni 

where the subscripts 1, 2, ... a, may specify position in the set of n items. 
(The three dots accompanying these symbols are read, and so on.) Match- 
ing the symbols with the values in column 3 of table 2.1, 

Xy = 16, A r 2 = 22, . . . An = 23 mg./lOO g. 

The mean is represented by x , so that 

x = (X\ + X 2 + . . . + X n )/n 
This is condensed into the form 

£ = (SX)/n, 

where X stands for every item successively. The symbol, SX . , is read, 
“summation A? 5 or “sum of the ATs” while x is called “*-bar,” or “bar-*.” 
Applying this formula to the values of X in table 2.1, 

SX = 340, and x = 340/17 = 20 mg./lOO g. 

The population mean may be symbolized by m in order to distinguish 
it from the sample estimate, x (compare the use of m in section 1.8). 

2.7 — Deviations from mean. The individual variations of the items 
in a set of data may be well expressed by the deviations of these items from 
some centrally located number such as the mean. For example, the devia- 
tion-from-mean of the first A'-value in table 2.1 is 16-20 = — 4 mg. per 
100 g.; that is, this specimen falls short of the mean by 4 mg./lOO g Of 
special interest is the whole set of deviations calculated from the array in 
section 2.4: 

-7, —5, —4, —4, —3, —2, -1, 0, 0, 1, 1, 2, 2, 3, 3, 5, 9 

These deviations are represented graphically in figure 2.1 by the distances 
of the dots from the vertical line drawn through the mean. 

Deviations are almost as fundamental in our thinking as are averages 
themselves. “What a whale of a pig” is a metaphor expressing astonish- 
ment at the deviation of an individual’s size from the speaker’s concept 
of the normal. Gossip and news are concerned chiefly with deviations 
from accepted standards of behavior. Curiously, interest is wont to center 
in departures from norm, rather than in that background of averages 
against which the departures achieve prominence. Statistically, freaks 
are freaks only because of their large deviations. 

Deviations are represented symbolically by lower case letters. That is: 

*i = X\ — x 
xt = Xi — x 


X n = X n ~ X 
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Just as X may represent any of the items in a set, or all of them in succes- 
sion, so x represents deviations from mean. In general, 

* = X - x 

It is easy to verify the theorem that the sum of a set of deviations- 
from-mean is zero; that is Sx = 0. The set listed in table 2.1 adds to 
zero, the sum of the positive deviations being equal to the sum of the 
negatives. This theorem about deviations-from-mean is useful for verify- 
ing the calculation of a set of deviations — be sure that the sum is zero. 
As a consequence of the theorem, it follows that the mean of the deviations 
is zero. This is a theorem about which you will be reminded later. 

EXAMPLE 2.1 — The weights of 12 staminate hemp plants in early April at College 
Station, Texas, (17), were approximately: 

13, 11, 16, 5, 3, 18, 9, 9, 8, 6, 27, and 7 grams 
Array the weights and represent them graphically. Calculate the mean, 11 grams, 
and the deviations therefrom. Verify the fact that Sx = 0. 

EXAMPLE 2.2— The heights of 11 men are 64, 70, 65, 69, 68, 67, 68, 67, 66, 72, 
and 61 inches. Compute the mean and verify it by summing the deviations. Are the 
numbers of positive and negative deviations equal, or only their sums? 

EXAMPLE 2.3 — The yields of alfalfa from 10 plots were 0.8, 1.3, 1.5, 1.7, 1.8, 
2.0, 2.0, 2.0, 2 0, and 2.2 tons per acre. How many deviations are positive and how 
many negative? Is their sum zero? 

EXAMPLE 2.4 — The weights of 1 1 forty-year-old men were 148, 154, 158, 160, 
161, 162, 166, 170, 182, 195, and 236 pounds. Contrast the graphical representation 
of this array with that of the preceding example. Notice the fact that only three of the 
weights exceed the mean. Why? Is the mean typical of this sample of weights? Is Sx 
- 0 ? 

EXAMPLE 2.5 — The following were the yields of two varieties of oats in five 
successive years (bushels per acre)* 



Year 

Variety 

1 

2 

3 

4 

5 

A 

34 

30 

41 

25 

45 

B 

30 

15 

33 

25 

25 


Compute the mean difference, 9.4 bushels per acre. Show that the sum of the devia- 
tions of the differences from the mean difference is zero. 

EXAMPLE 2.6 — The following data are adapted from Reddy’s (14) investigations 
of the differences in yield attributable to the disinfection of Diplodia infected seeds of 
maize. The figures represent bushels per acre. 






Pairs of Plots in 1933 



Treatment 

1 


2 

3 

4 

5 

6 

Treated 

68 

1 

74.6 

64 4 

69 2 

61 8 

57.9 

Untreated 

64 

7 

62 5 

66 8 

69.2 

53.9 

58.5 

Differences 

3 

4 

12 1 

— 2 4 

0 0 

7.9 

-0.6 
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Paiis of Plots in 1934 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

18 0 
10 9 

24 0 
24 4 

18 8 
15 1 

17 8 
16 8 

18 5 

13 2 

27 2 

21 6 

23 6 
13 7 

23 9 
17 5 

20 3 
16 3 

11.9 

15.5 

7.1 

-0 4 

3.7 

1.0 

5.3 

5.6 

9 9 

6 4 

4.0 

— 3.6 


Verify the fact that the mean difference is (approximately) 3.7 bushels per acre. 
Represent graphically the array of differences. It is worthy of note that the differences 
are rather uniformly distributed throughout the two years despite the great disparity 
in the seasons’ yields. 

EXAMPLE 2.7 — If you sum the deviations from 3.7 bushels per acre in the fore- 
going example you will not get zero. Why? If you compute the mean of the devia- 
tions and add it to 3.7, will you get the exact mean of the differences? Are you keeping 
clearly in mind the distinction between deviations and other differences? 

EXAMPLE 2.8 — Suppose you wish to estimate the yield of a field of 300 rows of 
corn. You actually harvest 10 rows and determine the mean yield, 5 bushels per row. 
Would you hesitate to fix the field yield at 1,500 bushels? You would be assuming 
that the field mean is the same as that of the 10 harvested rows, and would be using 
the theorem, SX » nx. 

EXAMPLE 2.9 — If you have some skill in algebra, prove the theorem that Sx — 0. 
Starting with the relation, x = X — x , sum both members, then substitute SX = n£. 

EXAMPLE 2.10 — If you have two sets of data which are paired as in example 2.5, 
and if you have calculated the resulting set of differences, prove that the mean of the 
differences is equal to the difference between the means of the two sets. Verify this 
theorem by use of the data in example 2.5. 

2.8 — The standard deviation. We have mentioned the range as a 
measure of variation; a good one, too, if no very discriminating work is to 
be done. A moment’s thought will convince you, however, that it is not 
reliable. Just look back at figure 2.1 and call to mind the infrequency 
of occurrence of extreme measurements There are not many of them in 
the whole population, making it improbable that they will be well repre- 
sented in a small sample. Many samples, in fact, will include none oi the 
population oddities, being made up entirely of those more numerous indi- 
viduals nearer the mean. You see, the range, depending wholly upon the 
difference between the two sample extremes, is likely to be erratic in 
sampling. For exacting investigations it is desirable to have a more stable 
average of variation, one resembling the mean in characteristics and 
method of computation. 

With this ideal in mind, one naturally turns to the set of deviations in 
section 2.7. Surely, an average of variation should be founded on these 
measures of individual variation. Since their mean, however, is always 
zero, that simple average is worthless. A natural suggestion is to ignore 
the signs, computing the mean of the absolute values of the deviations. This 
average, the mean absolute deviation , had a considerable vogue in times past, 
but it was soon found to lead into a blind alley so far as statistical theory 
is concerned. 
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After much experience, statisticians selected an average of variation 
called standard deviation , symbolized by j. Its calculation is set out in the 
right hand part of table 2.1. First, each deviation is squared. Next, the 
sum of squares , 5* 2 , is divided by the number of degrees of freedom , (n — 1), 
yielding the mean square or variance , s 2 . Finally, the extraction of the 
square root recovers the original unit of measurement, milligrams per 
100 grams of juice, so the standard deviation is a typical deviation in much 
the same way that the mean is a typical observation. 

2.9 — Why square the deviations? In a non-mathematical discussion, 
it is quite impossible to give an adequate answer to this question. The 
most convincing justification of the process must lie in the value of the 
results produced, and these will become increasingly evident as your 
knowledge of statistics expands. Nevertheless, some excuses may be in 
order. Attention was called to the facts that neither the deviations nor 
their absolute values furnished satisfactory averages. Squaring the devia- 
tions is the next operation in respect of simplicity. Incidentally, the squares 
are all positive, increasing the ease of computation. Some people think 
that squaring makes harder calculations than absolute values would. 
While this is true in the simple form of table 2.1, it is exactly contrary 
to the facts in the more extensive calculations that will be required later. 
The squares lend themselves amazingly to short cuts in computation 
and powerful methods in statistics. 

More fundamentally, squares of deviations are basic in the theory of 
the best known and most widely used frequency distribution, the normal. 
You will hear more of this distribution in later chapters. It is symmetrical, 
contrasting sharply with that of chi-square (section 1.11). The values in 
table 2.1 and the differences in example 2.6 may well be samples drawn 
from normally distributed populations. It is the widespread incidence 
of this and related populations in biology that accounts for the well-nigh 
universal use of squared deviations for making the average of variation. 

People with a vivid sense of form find help in understanding the 
nature of standard deviation by considering the geometric interpretation 
of the computations of table 2.1. Draw to scale 17 straight lines, one 
representing each of the deviations. The zero deviations may be sym- 
bolized by points. Construct a square on each of the lines as side. The 
areas of these squares correspond to the squares of the deviations in the 
last column of the table. The sum of all the areas is 254 square units. 
The average area, the mean square, is 15.88 square units. The standard 
deviation, the square root of 15.88, is the side of a square having the aver- 
age area, and is measured in the same unit as are the 17 deviations and 
the 17 observed values. 

2.10 — Why divide by (n — 1)? This is another question whose 
answer lies in the mathematical theory, but which has implications 
important to the biologist. Two of these will be mentioned here. The 
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first has to do with the sample itself, consisting of n independent observa- 
tions. From these the mean, x (a statistic ), is computed and set up as an 
estimate of the mean, m (a parameter ), of the population. Then deviations 
are calculated. These, however, are not all independent. Since their 
sum must be zero, after (n — 1) of them are selected at random the nth is 
fixed. Suppose, for example, four out of five deviations are taken quite 
arbitrarily as 3, — 1, 2, and 5. The fifth, then must have the value, 

0 — (3 — 1+2 + 5) = —9 

This is an explanation of the inference made above that there are only 
(n — 1) degrees of freedom associated with the sum of squares, Sx 2 , the nth 
degree of freedom being assigned to the average from which deviations 
are measured. The mean square, or variance, is therefore defined as 
Sx*/(n — 1). 

The second implication is equally important. If we were concerned 
only with a description of sample variance, we might still insist on dividing 
the sum of squares by n. But our prime interest is in the population 
variance. How can we obtain an unbiased estimate of that parameter? 
The answer is to divide the sum of squares by the number of degrees of 
freedom, {n — 1), instead of by the sample size, n. One explanation is 
easy to find. It lies in the improbability of drawing representatives of the 
population extremes in small samples. The range of the sample is likely 
to be much less than that of the population. The sample mean square 
deviation (variance) would likely fall short of the parameter if division 
were by n. “Student” (16) showed in 1908 that this tendency is compen- 
sated, on the average, if the denominator (n — 1) is used. While the 
distinction is numerically unimportant in large samples, it is vital in 
much experimentation. 

EXAMPLE 2.11 — Verify the facts that the mean of the following set is 10 grams 
and the standard deviation, 4 grams: 

3, 6, 6, 12, 9, 12, 10, 9, 12, 14, 17 

EXAMPLE 2.12 — Compute the standard deviation, 6.7 grains, of the weights of 
hemp plants in example 2.1. Note that somewhat more than two-thirds of the weights 
fall within the interval 11.0 at 6.7 grams. 

2.11 — Some characteristics of the standard deviation. The squaring 
of the deviations has two slightly inconvenient results: the distinction 
between positive and negative deviations is lost, and the few large devia- 
tions contribute a disproportionate amount to the sum of squares. It is 
imperative to understand clearly the consequent characteristics of the 
standard deviation. Like any other average it is, in a specific way, 
typical of the deviations it represents, and is intermediate among them in 
size. Let us look at these deviations (section 2.7), shorn of their signs and 
rearrayed: 

0, 0, 1, 1, 1, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 7, 9 mg./lOO g. 
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There is no symmetry about this set. It might well have been drawn from 
a population distributed like chi-square. Certainly, the mean of these 
deviations (that is, the mean absolute deviation) will not be centrally 
located in the set. Both these results follow the loss of the signs. Further- 
more, the two largest absolute deviations contribute more than half of the 
total sum of squares (table 2.1). The consequence is that 11 of the 17 
absolute deviations are smaller than s (3.98 mg./lOO g.) and only 6 larger. 

Much the same account can be given without the bother of absolute 
deviations. Look back at the array of X in section 2.4. Consider the 
interval from 

x - s = 20 - 3.98 = 16.02 mg./lOO g. 
to 

x + s = 20 + 3.98 - 23.98 mg./lOO g. 

This interval includes the 1 1 values, 

17, 18, 19, 20, 20, 21, 21, 22, 22, 23, 23, mg./ 100 g. 

These 11 items constitute 65% of the total of 17. Such a majority of 
observed values within the interval, x =*= s, is common in sampling. It 
reflects the fact that in normally distributed populations with mean, m, 
and standard deviation, a (the Greek lower case sigma is used to dis- 
tinguish the population parameter from the sample estimate, s), the inter- 
val, ?72 =±= a, contains 68.27% of the items; that is, about two-thirds. Of 
course, it may be expected that in random samples, even those drawn 
from normal populations, this fraction may vary considerably. Neverthe- 
less, one should always take the precaution to verify his computation of s 
by observing the proportion of items falling between x — s and x + s. 
If it is radically different from two-thirds, mistakes in arithmetic are 
suggested. 

If you study mathematical statistics, you may hear a good deal about 
the Principle of Least Squares. The mean and deviations therefrom are 
related to that principle in this manner: if deviations are measured from 
the mean, then the sum of their squares is a minimum. In particular 
(and reversing the statement) if deviations in table 2.1 are measured from 
some number different from the mean, 20, the sum of their squares will 
be greater than 254. Verify this by trying deviations from, say, 19 then 22. 

This least squares theorem about the mean throws further light on 
division by n — 1. Since the mean of a sample is usually different from 
that of the population, the sum of squares of deviations from the sample 
mean is commonly less than the corresponding sum of squares of deviations 
from the population mean. In sampling from a normal distribution, 
division by n — 1 counterbalances this tendency, giving an unbiased 
estimate of the population variance. The standard deviation, is not an 
unbiased estimate of the population standard deviation , a*, but tends to be 
smaller. This is one reason for using the variance so much, another being 
its greater ease of computation. 
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As would be expected, the range and standard deviation are not un- 
related. If you draw a lot of samples from a normally distributed popula- 
tion, each sample with 9 or 10 items, you will find that the average of 
their ranges is about three times the population standard deviation. 
Hence, one-third the range of each such sample is an estimate of a in the 
population. Similarly, if n is between 25 and 30, one-fourth the range is 
the appropriate estimate of a (see table 5.5). By fixing in mind a few such 
values (section 5.8), one has a basis for estimating the standard deviations 
of familiar populations. Even more, he can make fairly good guesses at 
the standard deviations, s , of the samples observed. As an example, sup- 
pose a group of 27 young men of age 22 are randomly chosen insofar as 
stature is concerned. Measure the heights of the shortest and tallest of 
the group and subtract to get the range, say 

73 — 62 = 11 inches. 

Not only is one-fourth of this range, 2.8 inches, an estimate of the standard 
deviation of the population of 22-year-old men, known to be about 2.7 
inches, but it is also a rough estimate of the standard deviation of the 
sample. This process is valuable for two reasons: it serves as a check on 
computations of s , and it enables one to translate his common knowledge 
of range into the new quantity, standard deviation. 

It seems rather characteristic that large things vary much and small 
things little. For this reason it is often convenient to express the standard 
deviation as a fraction of the mean, the resulting statistic being called 
relative standard deviation or coefficient of variation, C. As an example, it is 
reported (1) that the average statures of one-year and eighteen-year 
girls are 74.4 and 161.0 cm. respectively, with standard deviations 2.64 
and 6.12 cm. The two coefficients of variation are 

Ci = 2.64/74.4 = 0.036 
Cm = 6.12/161.0 = 0.038, 

almost the same. Usually C is expressed as a percentage, C\ for example, 
being 3.6%. Discussion of this characteristic of the standard deviation is 
resumed in section 2.15. 

EXAMPLE 2.13 — The birth weights of 20 guinea pigs, borne in litters of two, were: 

30, 30, 26, 32, 30, 23, 29, 31, 36, 30, 25, 34, 32, 29, 28, 27, 38, 31, 34, 30 grams. 

In computing the deviations subtract, not the exact mean, but the nearest integer, 
30 grams. This will save squaring a lot of decimals. Is the sum of deviations zero? 
What is the mean of the deviations? How does your standard deviation compare 
with the correct value, 3.60 grams? What proportion of the birth weights lie in the 
interval 30.25 =*= 3.60 grams? 

EXAMPLE 2.14 — The standard deviation of the differences in yield, example 2.6, 
is 4.46 bushels per acre. For computation, carry only one decimal place in the mean, 
then compare your result with that just stated. For ordinary purposes it is quite all 
tight to retain in the mean only the precision of the observations. More precise results 
can be got with little extra labor by the methods of chapter 5. 
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EXAMPLE 2.15 — Would there be any object to calculating the mean and standard 
deviation of the 16 yields from treated seed in example 2.6? Would they be estimates 
of the parameters of any population? Suppose 10 plots had been harvested in 1933 
and 6 in 1934, would the estimates be altered? Would the populations be affected? 

EXAMPLE 2.16 — In samples of 15 to 17 items the range is, on the average, about 
3.5 times the standard deviation. Verify this statement from the sample of assays in 
table 2.1. How does it compare with the facts in example 2.6? The statement may be 
written in any of these three ways: 

range = 3.5r 
(range)/r =* 3.5 
(range)/3.5 = s 

For samples containing numbers of items different from 15 to 17, see section 5.7. 

EXAMPLE 2.17 — The mean yield of hay from 15 plots of alfalfa was 2.2 tons per 
acre, with s — 0.35 ton per acre. Would you estimate the range of the 15 plot yields 
as 1.2 tons per acre? Would this suggest that the highest yielding plot bore about 2.8 
tons per acre? 

EXAMPLE 2.18 — Suppose you lined up according to height the 16 men in two 
squads of 18-year-old freshmen, then measured the height of the shortest, 63 inches, and 
of the tallest, 72 inches. Would you expect the standard deviation of the 16 heights to 
be about 2 6 inches? Would you accept the midpoint of the range, 6 7 5 inches, as a 
rough average height? 

EXAMPLE 2.19 — In a normally distributed array of the gains in weight of 15 
swine during a 120-day feeding experiment, the mean gain was 180 pounds. Two- 
thirds of the gains fell within the interval from 145 to 215 pounds. Do you agree that 
the standard deviation was probably about 35 pounds? If so, would you estimate the 
least of the 15 gains at about 120 pounds? 

EXAMPLE 2.20— If you like puzzles, try making an approximately normal set 
of 16 items with £ = 20 and s =* 4. It is easier to start with a set of deviations. Test 
your result by counting the number of items in the interval 20 =*= 4, and by comparing 
the range with 3.5r. 

2.12 — The confidence or fiducial inference. After occupying the 
foregoing nine sections with concepts necessary to making this second 
inference about the population, we shall still have to anticipate some 
ideas that will be developed in the next chapter. This is in marked con- 
trast to the sequence in chapter 1, where the second inference followed 
immediately after the first (section 1.3). Looking forward, then, we bor- 
row a formula and a table (section 3.6 and table 3.8) which will enable 
us to set limits to the accuracy of the mean, 20 mg./lOO g., of the sample 
of vitamin C determinations. The formula, 


Si = s/y/n , 


enables us to estimate the standard deviation of means of samples drawn from 
some single population. Means are expected to vary less than items. The 
formula indicates this and gives the quantitative relation: the standard 
error , as Si is usually called, is inversely proportional to the square root of 
the sample size, n. For the vitamin C, 

Si = 3.98/\/l7 = 0.97 mg./lOO g., 


as recorded at the end of table 2.1. This says that if many sam* 
were drawn from the same population, and if the sta« 


(. * 

O 
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their means were calculated as in table 2.1, the answer is estimated from 
the single sample as 0.97 mg./lOO g. It is to be noted that means of 
samples of 17 vary about one-fourth as much as the items themselves. 

Turning now to table 3.8, the quantity t is a ratio defined by the 
formula, 

=*=/ = (x — m)/sz 

We know 2 and s± from our sample and can select an appropriate value 
of t from the table; therefore, we can calculate 

m = x =±= tSx 

Now, the sampling distribution of t , like that of x 2 > has been worked out 
so that one may select a value corresponding to many convenient proba- 
bilities. If we choose the 5% level, for example, and substitute it along 
with x and sz in the equation above, there will result 2 values of m which 
are called fiducial limits . Designating them as l\ and h , we have: 

l\ ~ x — t .06^ £ and I2 ~ x t.o^Sz 

Finally, if we say that the population mean, m , lies between these limits, 
the probability of being misled by the sampling is only 0.05. 

In the vitamin C sample, 2 = 20 and s t = 0.97 mg./lOO g.; for 16 
degrees of freedom, 1 05 = 2.120; hence, 

/1 = 20 — (2.120)(0.97) = 18 mg./lOO g., and 
h = 20 + (2.120)(0.97) = 22 mg./lOO g. 

We may assert, then, that in the sampled population the average vitamin 
C content is between 18 and 22 mg./lOO g., and we shall be right unless a 
one-in-twenty mischance has occurred in the sampling. 

Fisher (7) designates the interval from l\ to / 2 as the 0.95 fiducial 
interval. This corresponds to the 0.95 confidence interval of table 1.1, and 
the inference now made about the population mean is of the same general 
nature as that learned in chapter 1 . 

What are the theoretical conditions that must be met if the first two 
inferences are to be correct? Random sampling from a single, normally 
distributed population is one way to state them (in technical language, 
these are sufficient but not necessary conditions). It is hard to substitute 
any more general statement instead of the requirement of randomness, 
so let’s stick to that. But normal distribution of the population is a specifi- 
cation that can be considerably relaxed. This is fortunate, because we 
seldom know the distribution of sampled populations. From time to 
time in what follows, attention will be called to devices appropriate to 
samples from non-normal distributions. Otherwise, we shall assume that 
anormality affects our inferences but little. 

Are the theoretical conditions realized in the vitamin C sampling? 
The published report is not in sufficient detail to warrant judgment. Each 
manufacturer sent a specimen for analysis, but no information is given as 
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to the manner of selection. We assume random choice. As for normality, 
there is certainly no evidence against it in the sample (for tests of normality, 
see chapter 8). Finally, the requirement of sampling from a single popu- 
lation was probably not included in the plans. The vitamin C content of 
each brand may have had its own population mean and standard devia- 
tion, different from those of the others, and the quantities of juice marketed 
under the several brands probably differed. The design and statistical 
analysis of samples from such groups of populations will be discussed in 
chapter 17. At present we can say only this: our inferences about the 
population are exact if the theoretical conditions were met. As has been 
mentioned before, it is likely that all practical samplings depart more or 
less from the mathematical models. If the investigator plans his experi- 
ment with due regard to theory, then executes it skillfully, his inferences 
should be reliable approximations. 

EXAMPLE 2.21 — In examples 2.1 and 2.12 you calculated £ =* 11 grams and 
s ** 6.7 grams for the April weights of 12 staminate hemp plants. Assuming random 
sampling from a normal population of weights, compute the 95% fiducial interval 
as 6.7 — 15.3 grams. I hope you are impressed by the unreliability of this small sample 
with a standard deviation more than half the size of the mean (C * 61%). 

EXAMPLE 2.22 — In an investigation of growth in school children of 8 private 
schools (8), the mean height of 265 boys of age 13.5 to 14.5 years was 63.84 inches with 
standard deviation, 3.08 inches. What is the 95% fiducial interval of the mean of the 
population represented? Ans. 63.5 — 64.2 inches. Calculate C — 4.8%. 

EXAMPLE 2.23- — Set 99% fiducial limits to the population mean of the fore- 
going example. Ans. 63.4 — 64.3 inches. 

EXAMPLE 2.24 — The mean and standard deviation of the original assays in 
column 2 of table 2.1 are respectively 19.C and 3.95 mg./lOO g. What is the 95% 
fiducial intervaP Ans. 17 6 — 21 6 mg /1 00 g. Compare this with the interval, 
18 — 22 mg./lOO g., computed from the modified set in column 3 of the table. 

2.13— The comparison of individuals. Test of a null hypothesis. 

As pointed out in chapter 1, the investigator usually has definite questions 
about how his experimental material will behave under specific circum- 
stances. To get evidence, he sets up an experiment, more or less compli- 
cated. One of the simplest is planned to contrast the behaviors of two 
similar individuals subjected to different conditions. The individuals to 
be compared may be field plots or pigs or colonies of bees. But if there 
were only a single pair it would be impossible to say whether the difference 
in behavior is to be attributed to the conditions imposed or to the natural 
variability of the individuals or partly to each. Hence, there must be two 
or more pairs, or replications , one member of every pair being chosen at 
random to receive the first treatment, the other member the second. The 
differences between the measurements of the two members of the pairs 
constitute the sample data upon which inferences arc to be based. If there 
were no individual variation the differences would presumably be all 
alike. Actually, the variation among the differences is called experimental 
error. The outcome of the experiment is evaluated by comparing the aver- 
age difference, attributable to treatment, with the standard error of this 
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mean difference. Let us examine such an experiment together with the 
statistical method for reducing the data. 

Youden and Beale (19) wished to measure the difference in concentra- 
tion of two preparations of the mosaic virus. The method employed was 
to rub half a leaf of a tobacco plant with a piece of cheesecloth soaked in 
one preparation of the virus extract, and to rub the other half leaf similarly 
with the second preparation. The measurement of concentration is the 
number of local lesions appearing on the half leaf. The data reported in 
table 2.2 are measurements from leaf No. 2 on each of 8 plants. It is 

TABLE 2.2 

Numbers of Lesions on Halves of 8 Tobacco Leaves, Each Being Leaf No. 2 
on 8 Plants. The Lesions Measure the Concentrations of Two Preparations 

OF THE MoSIAC VlRUS* 


Plant No. 

Lesions on Half Leaves 

Difference 

X - Xi - A3 

Deviation 
* - X - £ 

Squared 

Deviation 

Preparation 1 Preparation 2 
A3 A3 

1 

9 

10 

- 1 

— 5 

25 

2 

17 

11 

6 

2 

4 

3 

31 

18 

13 

9 

81 

4 

18 

14 

4 

0 

0 

5 

7 

6 

1 

-3 

9 

6 

8 

7 

1 

— 3 

9 

7 

20 

17 

3 

-1 

1 

8 

10 

5 

5 

1 

1 

Total 

120 

88 

32 

0 

no 

Mean 

15 

11 

X = 4 lesions 


r 2 - 18 57 


s = a/ 18 57 - 4 31 lesions, j* 2 « 18 57/8 « 2 32. s £ - 1 52 lesions. 


* Slightly changed to make calculation easier. 


known that not only do individual tobacco plants react differently to the 
virus, but also leaves in different positions. Hence, each preparation is 
applied to every leaf, the halves tending to respond alike if treated alike. 
Thus, the difference in concentration of the two preparations is measured 
by the difference between the numbers of lesions as recorded in column 4 
of the table. These 8 independent measurements of the difference in 
concentration constitute a random sample from a population of such differ- 
ences, and can be summarized in exactly the same way as the sample of 
vitamin determinations in table 2.1. The mean difference, 4 lesions, is an 
estimate of the population mean, with s * = 1.52 lesions as the correspond- 
ing standard error. 

Remembering one object of the experiment, to learn whether there is a 
difference in concentration between the two preparations, we now set up 
the hypothesis that the population mean difference is zero, then ask, 
“What is the probability of a sample mean larger than 4 lesions in random 
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samples of 8 differences from the specified population?” To get this infor- 
mation the quantity, t, of table 3.8 is again invoked. Now, by hypothesis, 
m = 0, so 

/ - (x - 0 )/ Si * 4/1.52 = 2.63 

This sample value of / is to be compared with the tabular values opposite 7 
degrees of freedom. Since 2.63 lies about one-fourth the way from the 5% 
level to the 1%, the conclusion is that so large a value of t would occur 
in only 4 per 100 samples from the hypothetical population. As we 
learned in chapter 1, the null hypothesis is likely rejected, and the two 
preparations considered different in concentration. You see, the third 
inference about a population, described rather elaborately in section 1.6 
to 1.13, has now been adapted to measurement data. The sampling 
justification of the process, analogous to that of sections 1.9 to 1.12, is 
reserved for chapter 3. With the experience acquired in the first chapter, 
you have probably had no difficulty in following the condensed presenta- 
tion in this section. 

Having concluded that there is a population difference, you may wish 
to set bounds to it. We have, for d.f. — 7, /. 0 6 = 2.365, and hence 

h = 4 - (2.365) (1 .52) = 0.41 lesions, and 
h = 4 + (2.365) (1.52) = 7.59 lesions 

If wc say, then, that the difference in population concentration is measured 
by a number of lesions between 0.41 and 7.59, we shall be right unless a 
one-in- twenty chance has come off. 

EXAMPLE 2.25 — Using the statistics of examples 2 6 and 2 14 (x * 3.71 and 
s = 4 46 bushels per acre, with n = 16) calculate t <= 3.33 under the hypothesis 
that m -■ 0. Compare this with the 1 % level, / = 2 947. What conclusion do you draw? 

EXAMPLE 2.26 — L C. Grove (9) determined the mean numbers of florets pro- 
duced by seven pairs of plots of Excellence gladiolus, one plot of each pair planted with 
high (first year) corms, the other with low (second year or older) conns. The plot 
means were a* follows 


Corm 

Florets 

High 

Low 

11 2 

14 6 

13 3 

12 6 

12.8 13 7 

15 0 15.6 

12 2 

12 7 

11 9 

12 0 

12 1 

13 1 


Compute the differences, the mean difference, 1.2 florets, the standard deviation, 1 4 
florets, and t, 2.3, assuming m — 0 What is a reasonable inference about the popula- 
tion mean difference? 


EXAMPLE 2.27 — Mitchell, Burroughs and Beadles (11) computed the biological 
values of proteins from raw peanuts (P) and roasted peanuts (R) as determined in an 
experiment with 10 pairs of rats. The pairs of data P, R are as follows: 61, 55; 60, 54; 
56, 47; 63, 59; 56, 51; 63, 61; 59, 57; 56, 54; 44, 62; 61, 58. Compute the mean dif- 
ference 2.1, and the standard deviation of the differences 7.41 units. Since t = 0.90, 
approximately 40 out of 100 similar samples from a population with m == 0 would be 
expected to have larger lvalues. 

Note: 9 of the 10 differences, P — P, are positive. One would like some informa- 
tion about the next-to-the-last pair 44, 62. The first member, especially, seems abnormal. 
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While unusual individuals occur in the most carefully conducted trials, their appear- 
ance demands extra attention. You may have the curiosity to recalculate the statistics 
omitting this divergent pair. It always astonishes me that a single pair can reverse the 
conclusions. 

EXAMPLE 2.28 — Crampton (5) reports the gains in weight per 100 pounds of 
feed eaten for 10 pairs of swine constituting lots I and III of MacDonald College hog 
feeding trial 33B. The pigs were paired on the basis of being litter mates of as nearly 
the same initial weight as feasible. The pairs of gains are as follows, the datum for the 
member of lot I in each pair being set down first: 17, 20; 22, 21 ; 22, 21; 15, 22; 24, 24; 
22, 22; 21, 23; 21, 21; 17, 22, 21, 23 pounds gain per 100 pounds feed. Compute the 
mean difference, 1.7 pounds, in favor of lot III, and the standard deviation 2.7 pounds. 
Retain one decimal place in your deviation from mean if you wish to verify this stan- 
dard deviation. Calculate t, and observe the probability of occurrence of a larger if 
m * 0. Ans. / «* 2.0 and P = 0 08. 

EXAMPLE 2.29 — Prof. H. PI. Love, of Cornell University, took a leading part in 
introducing “Student’s” small sample theory in America. In one of his illustra- 
tions (10) he compared the yields of Great Northern ( G ) and Big Four ( B ) oats in the 
nine years 1912-1920. The differences, G — B, were 16.3, 13.4, 3.8, 7.9, 2.6, 2.5, 
9.6, 7.2, and 3.3 bushels per acre. Compute x — 7.4, and s — 5.0 bushels per acre. 
The difference in the yields of the two kinds of oats is highly significant. 

EXAMPLE 2.30 — Salmon (15) mentions a case in which 648 paired cooperative 
yield tests of Kanred and Turkey winter wheats were conducted in Kansas. It was 
desirable to make a rough test of significance without computing x and s. Instead, the 
number of tests in which Kanred outyielded Turkey was counted, 429, with six ties. 
Dividing the ties equally, it may be said that Kanred outyielded Turkey in the ratio 
of 432.216 tests. This ratio can be tested for significance against the theoretical 1:1, 
using the methods of chapter 1. The value, x 2 — *71, makes significance indubitable. 
Brandt (2) gives a graphical method for making the test. 

7 he reader may well omit the remainder of this chapter on the first reading. He may turn to 
chapter 3, or if he wishes to acquaint himself first with methods only , he may skip to chapter 4. 

2.14 — Sample size. By substituting m = 0 and s £ — s/y/n , the formula 
for t may be written 

t = xy/rt/s, 

from which, 

n = t 2 s 2 /x 2 or n ~ PC 2 , 

where C is the coefficient of variation defined in section 2.11. Thus for 
any specified probability of /, sample size may be estimated if any reason- 
able assumptions about s and x are available. 

As an illustration, refer to example 2.27 where statistics for the bio- 
logical value of proteins from raw peanuts were x — 2.1, s = 7.4, and 
consequently C = 7.4/2. 1 — 3.5 (that is, 350%). If one assumes that 
approximately this same ratio will eventuate in a larger sample, he may 
determine the number of pairs of rats necessary to produce t = 2, 


n = 2 2 (3.5) 2 = 49 


If this larger sample is drawn, it may turn out to have a value of C either 
greater or less than 3.5, with t correspondingly either beyond the 1% 
level or less than the 5%. One should be prepared to reject the null 
hypothesis or to accept it according to the evidence of the new sample. If 
the decision is important enough, the substituted value of t 2 may be 
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increased to 7, corresponding roughly to the 1% level. However, it 
seems a good guess that the unusual pair, 44, 62, would seldom appear 
again, and that even 10 pairs would ordinarily indicate a real population 
difference. 

A moment’s thought will make it clear that sampling might be con- 
tinued until an unusually large value of t is turned up even if m is really 
zero. If one quits when the results tally with his wishes, he modifies the 
probabilities set down in table 3.8. He might even be misled by a second 
sample designed as above, because this larger sample might be the freakish 
one. 

It may be well to emphasize again the idea that statistical evidence 
is not proof. Even after extensive sampling the investigator may not 
reject the null hypothesis when in fact the hypothesis is false. For example, 
m may not be zero in the population, yet natural variation may be great 
enough to confine t to a moderate value in any practicable size of sample. 
Therefore, when one fails to reject the hypothesis he does not thereby 
conclude that m is zero. He decides only that m is so small as to be un- 
important to his investigation. 

2.15 — The relative variation. Coefficient of variation. In section 
2.11, C was defined as s/x. As was indicated, its utility lies mainly in the 
fact that in many series the mean and standard deviation tend to change 
together. This is illustrated by the mean stature and corresponding 
standard deviation of girls from 1 to 1 8 years of age shown graphically in 
figure 2.2. Until the twelfth year the standard deviation increases at 
somewhat greater rate, relative to its mean, than does stature, causing 
the coefficient of variation to rise, but by the seventeenth year and there- 
after C is back to where it started. Without serious discrepancy one may 
fix in mind the figure, C — 3.75%, as the relative standard deviation of 
human stature, male as well as female. More precisely, the coefficient 
rises rather steadily from infancy through puberty, falls sharply during a 
brief period of uniformity, then takes on its permanent value near 3.75%. 

A knowledge of relative variation is valuable in planning and in evalu- 
ating experiments. In the preceding section was an example of the utility 
of C in estimating sample size. After the statistics of an experiment are 
summarized, one may judge of its success partly by looking at C. In 
corn variety trials, for example, although mean yield and standard devia- 
tion vary with location and season, yet the coefficient of variation is often 
between 5% and 15%. Values outside of this interval cause the investi- 
gator to wonder if an error has been made in calculation, or if some 
unusual circumstances throw doubt on the validity of the experiment. 
Similarly, each sampler knows what values of C may be expected in his 
own data, and is suspicious of any great deviation. 

Other uses of the coefficient of variation are numerous but less preva- 
lent. Since C is the ratio of two averages having the same unit of measure- 
ment it is itself independent of the unit employed. Thus, it is the same 
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5 



w. 


Fig. 2.2 — Graph of 3 time series; stature, standard deviation and coefficient of variation 
of girls from 1 to 18 years of age. See reference (1). 


whether inches, feet or centimeters are used to measure height. Also, 
the coefficient of variation of the yield of hay is comparable to that of the 
yield of corn. 

Experimental animals have characteristic coefficients of variation, and 
these may be compared despite the diversity of the variables measured. 
The body weight of male albino rats between 90 and 243 days of age has 
an average C of about 14% (6), while that for annual egg production of 
Barred Plymouth Rock hens is close to 32% (13). Such information is 
essential to those planning experiments. 

Like many other ratios, the coefficient of variation is so convenient 
to use that some people overlook the information contained in the original 
data. Try to imagine how limited you would be in interpreting the stature- 
of-girls coefficients if they were not accompanied by x and s. You would 
not know whether an increase in C is due to a rising s or a falling x , nor 
whether the saw-tooth appearance of the C-curve results from irregulari- 
ties in one or both of the others, unless indeed you could supply the facts 
from your own fund of knowledge. The coefficient is informative and use- 
ful in the presence of x and r, but abstracted from them it may be mis- 
leading. 

EXAMPLE 2.31 — In example 2.28 the mean difference between the gains of ten 
pairs of pigs was 1.7 pounds with standard deviation, 2.7 pounds. Assuming the same 
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coefficient of variation in a following experiment, how many pairs should be included 
to bring t to its 5% level? Ans. Approximately 13 pairs. 

EXAMPLE 2.32 — In experiments involving chlorophyll determinations in pine- 
apple plants (18), the question was raised as to the method that would give the most 
consistent results. Three bases of measurement were tried, each involving 12-leaf 
samples, with the statistics reported below. From the coefficients of variation, it was 
decided that the methods were equally reliable, and that the most convenient one 
could be chosen with no sacrifice of precision. 


Statistics of Chlorophyll Determinations on 12-Leaf Samples From Pineapple 
Plants, Using Three Bases of Measurement 


Statistic 


Mean (milligrams) 

Standard deviation (milligrams) 
Coefficient of variation (percent) 


100-gram 
Wet Basis 

100-gram 
Dry Basis 

100-sq. cm. 
Basis 

61 4 

337 

13.71 

5.22 

31 2 

1.20 

8 5 

9.3 

8 8 


EXAMPLE 2.33 — In well-conducted experiments with properly developed and 
adequately fed growing swine, the standard deviation of gains in weight may be taken 
as 10% of the mean gain. If you are told that a lot of pigs gained 150 pounds during 
a certain period, would you expect the standard deviation of the gains to be about 1 5 
pounds? Would it be equally true that an average daily gain of 1.5 pounds per day 
should be accompanied by a standard deviation of approximately 0.15 pound per day? 

EXAMPLE 2 34 — In a certain laboratory there is a colony of rats in which the 
coefficient of variation of the weights of males between 56 and 84 days of age is close to 
13%. Estimate the standard deviation of the weights of a lot of these rats whose mean 
weight is 200 grams. Ans. 26 grams. 

EXAMPLE 2.35 — Assuming that the coefficient of variation of yields in field plot 
tests with wheat is usually near 5%, would you be surprised if told that in an experi- 
ment where the yield was 25 bushels per acre, the standard deviation of plot yields was 
0.5 bushel per acre? 

EXAMPLE 2 36 — In example 2.17, the coefficient of variation of the plot yields is 
16%. Do you know enough about alfalfa to judge whether that is a reasonable figure? 

2.16 — Probable deviation. This is an average of variation, widely 
used in the past but now rapidly being replaced by the standard deviation. 
In size, it is approximately two-thirds of the standard deviation. As for 
computation, one calculates the standard deviation first, as in table 2.1, 
then multiplies by 0.6745 to get the probable deviation. The reason for 
the factor, 0.6745, (or approximately, %) will be set out in chapter 8. 

What are the advantages of the probable deviation, p.d ., over the 
standard? Only one. In a normally distributed population, the interval, 
m =*= p.d., contains half the items. It is perhaps easier to think of one-half 
than of the 0.6827 associated with the interval m =*= a (section 2.11). 
This advantage is so far overbalanced by the many superiorities of the 
standard deviation as to make it seem likely that the probable deviation 
will disappear from use in biological statistics. 

In random samples drawn even from normally distributed populations 
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the proportion of items within the interval, x =*= p.d ., varies considerably. 
Examine the heights of men in example 2.2. Since s = 3 inches the 
probable deviation is 0.6745 X 3 = 2.02 inches. The interval, 67 =±= 2.02 
inches extends from 64.98 to 69.02 inches. Within this interval lie the 7 
heights, 65, 66, 67, 67, 68, 68, 69, constituting 64% of the entire sample 
number. Contrariwise, only seven of the sixteen differences in example 2.6 
lie in the interval, 3.7 =±= 3.0 bushels per acre (see example 2.14). How- 
ever, in larger samples drawn from normally distributed populations, one 
often finds approximately 50% of the items in the specified interval. 

It is necessary to become familiar with the probable deviation in order 
to read technical articles involving statistics. Remember, probable devia- 
tion is only about % of standard; while standard deviation, the larger, 
is 1.5 times the probable. 


2.17 — The median. The mean is the average of concentration pecul- 
iarly appropriate for describing samples from normally distributed popu- 
lations. When populations depart notably from the normal, the mean 
may not be at all typical. In such cases, another average of concentra- 
tion, the median , may be used for description, supplementing the mean. 
The median is the middle item in an array. 

As an example of its utility in description, look back at the weights of 
11 forty-year-old men listed in example 2.4. Contrast the graph of this 
array, figure 2.3, with that of the vitamin data of figure 2.1. Note that 
only three of the weights exceed the mean, 172 pounds. The median, 162 
pounds, is the sixth weight in the array, five weights being larger and five 
smaller. In a sense, the median describes the set of weights in better 
fashion than does the mean. Together, they give a fairly adequate account 
of the central tendency of the weights in this sample. The difference, 
(median) — (mean), is sometimes used as a rough measure of the skew- 
ness of a sample. The disability of the mean, alone, as an average of 
skewed sets is that it is unduly influenced by the few extreme items. 

Weights of mature individuals are usually distributed in asymmetrical 
fashion, the majority of them lying near the lower end of the array. 
Apparently, individuals find it difficult to survive with weights much lower 
than normal, but may carry far greater weights without immediate penalty. 
Skewed distributions result from any limit of variation at one end of the 
range. For this reason, dressing percentages of swine are apt to be con- 
centrated in the upper part of the array. Asymmetrical distributions 
constitute a difficult problem in statistics. We shall avoid them as much 
as possible until we reach a later chapter. 

If the median is used as the average of concentration, it is customary 
to describe the variation of a set by fixing two other values called quartiles 
Just as the median is the item at the middle of an array, so the quartiles 
lie at the middle of the lower and upper halves of the array. In the array 
of men’s weights, the third weight, 158 pounds, is the first quartile, Qi, 
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Fig. 2.3 — Weights of 11 forty-year-old men. The break attracts attention to the fact 
that the differences in weight are exaggerated in order to emphasize the asymmetry 

of the array. 


while the ninth, 182 pounds, is the third quartile, Q 3 . The median, then, 
might be called the second quartile, Q 2 . The inter -quartile interval, Qs 
— Q u is a measure of variation. Like the interval, x =*= pd. in the normal 
distribution, the interval Q 3 — Qi in every distribution contains 50% 
of the items. In a normally distributed sample, therefore, Qs — Qi = 2 pd. 

In a sample like that of example 2.39, there are no items exactly 
dividing the set into fourths. The median is then defined as the inter- 
polated value of item number (n + l)/2, which, in this case, is item num- 
ber (24 + l)/2 = 123^. The interpolated value is the mean of the two 
items on either side of this one, the twelfth and thirteenth. Its value is, 
then, (19.9 + 20.1)/2 = 20, the same as the mean. Similarly, Qi is the 
interpolated value of item number (n + l)/4 = 6%. This value divides 
the interval between the sixth and seventh items in the ratio, 1:3. Qi is 
therefore a weighted mean of these items, [3(15.3) + 16.3]/4 = 15.55. 
For the item number is 3(rz + l)/4 = 18%, and the interpolated 
value, 

[23.7 + 3(24.7)]/4 = 24.45 

The inter-quartile interval is 24.45 — 15.55 = 8.9, which is about as 
near to 2 pd. = 8.4 as one ordinarily gets in these approximately normal 
samples. 

The median-quartile description of a sample falls into the same cate- 
gory as the range, furnishing a rough-and-ready means of summarizing 
the data. So far as more advanced statistical methods are concerned, it 
is not often useful. 

EXAMPLE 2.37 — Verify the fact that the interval, # pd. t contains about half 
the values of X in table 2.1. 
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EXAMPLE 2.38 — The mean height of 840 thirteen-year-old boys is reported as 
57.3 inches with probable deviation 1.86 inches. Would you expect to find about 420 
of these heights between 55.4 and 59.2 inches? Compute s = 2.76 inches and C 
“ 4.8%. 

EXAMPLE 2.39 — The following approximately normal sample has the mean, 
20.0: 7.7, 9.9, 11.6, 13,0, 14.2, 15.3, 16.3, 17.2, 18.0, 18.8, 19.5, 19.9, 20.1, 20.5, 
21.2, 22.0, 22.8, 23.7, 24.7, 25.8, 27.0, 28.4, 30.1, 32.3. Estimate the probable deviation 
by measuring the interval, equally spaced on either side of the mean, within which 
half the items lie. Would you begin measuring from 16.3 or from 15.8? Using the 
method of table 2.1, s — 6.3, and thence, probable deviation = 4.2. Do two-thirds of 
the items lie within the interval, 20 =*= 6.3? 

EXAMPLE 2.40 — Try writing a normal sample of 16 items with mean 50 and 
probable deviation 10. Don’t forget example 2.16. Establish your success by using 
the method of computation in table 2.1. 

EXAMPLE 2.41— Show that, of the differences in example 2.6, Q\ = —0.30, 
Qt = 3.85, and Qj ** 6.92 bushels per acre. 

EXAMPLE 2.42 — In the sample of 11 weights of example 2.4 and section 2.16, 
would you expect to find Qi — Qi =» 2 p.d.? It is easy to test your expectations. 

EXAMPLE 2.43 — The weights of neutral fats in the blood plasma of 64 normal men 
(12) were 419, 162, 149, 219, 248, 313, 211, 169, 91, 281, 264, 172, 124, 235, 94, 62, 
224, 58, 92, 205, 132, 145, 305, 285, 174, 107, 240, 269, 396, 416, 662, 703, 249, 179, 
136, 157, 198, 95, 100, 178, 145, 199, 54, 407, 166, 94, 248, 235, 66, 120, 239, 128, 560, 
233, 80, 557, 217, 542, 252, 175, 103, 165, 351, 107 mg. per 100 cc. Array these weights 
and represent them graphically. Calculate the mean, 224.4 mg. per 100 cc. The 
median is 188.5 mg. per cc. Can you give an explanation of the skewness of the dis- 
tribution? 

EXAMPLE 2.44 — Calculate the deviations from the median for the 1 1 weights of 
40-year-old men. Is the sum equal to zero? Is the sum of the deviations from the mean 
equal to zero? 

EXAMPLE 2.45 — If you ignore the signs of the deviations in the preceding example, 
using only their absolute values, which is the larger, the sum of the absolute deviations 
from mean or median? 

EXAMPLE 2.46 — Clements and Long (3), in discussing the effects of light on 
the growth of sunflower plants, said that the difference in stature produced by lights 
in various intensities ranged from 14.8 to 32.0 cm., with mean equal to 23.3 cm. The 
midpoint of the range, (14.8 -f 32.0) /2 = 23.4 cm., is almost the same as the mean. Do 
you think this is convincing evidence of symmetry in the heights of the sunflower 
plants? 

EXAMPLE 2.47 — Observe that in figure 2.3 the midpoint of the range is^l92 
pounds greater than either median or mean. Why? 

2.18 — Description of a sample. Summary. The entire information 
in a sample from a normally distributed population is summarized by the 
three facts: number of observations, mean, and standard deviation. 
Little is gained by reporting the individual items in either table or graph. 
A statement of the fiducial interval may be helpful to the reader, as well 
as the probability associated with whatever hypothesis has been set up. 
One advantage of these summary numbers is that all the relevant informa- 
tion from the sample is conveyed briefly, leaving time and space for 
discussion of the biological implications. 

In this text, attention has been focused chiefly on inferences about the 
population from which the sample was taken. The mean and variance of 
a random sample from a normal population as unbiased estimates of the 
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corresponding parameters, the fiducial interval as an indication of the 
reliability of the sampling, and the test of some appropriate hypothesis — 
these are the inductions which can be made from sample to population. 

The replicated comparison of individuals is a simple and effective 
experimental design for testing the difference between two treatments. 
The two members of a pair would presumably behave alike if treated 
alike, and the difference between their behavior measures the differential 
treatment effect. Replication tends to average out variation if the indi- 
viduals of each pair are randomly assigned to the treatments. The varia- 
tion among the pair differences is a measure of the experimental error 
inherent in such a trial. 

The natural extension to replicated sets of individuals, each assigned 
to one of three or more treatments, will be made in chapter 11. Mean- 
while, we must pause to develop the theory which we have already found 
essential for both fiducial statements and tests of hypotheses. 

REFERENCES 

1. Bernice Boynton. University of Iowa Studies in Child Welfare, Vol. 12, No. 4 

(1936). 

2. A. E. Brandt. Journal of the American Statistical Association, 28:434 (1933). 

3. Frederic E. Clements and Frances C. Long. Plant Physiology, 9:779 (1934). 

4. Council on Foods. Journal of the American Medical Association, 1 10.651 (1938). 

5. Earl Wilcox Crampton. Journal of Nutrition, 7.305 (1934). 

6. Henry H. Donaldson. The Rat. The Wistar Institute of Anatomy and Biology, 

No 6. Philadelphia (1924). 

7. R. A. Fisher. Proceedings of the Royal Society of London, A, 139:343 (1933). 

Proceedings of the Cambridge Philosophical Society, 26*528 (1930). 

8. Horace Gray and J. G. Ayres. Growth in Private School Children. The Uni- 

versity of Chicago Press, Chicago (1931). 

9. L. C. Grove. Iowa Agricultural Experiment Station Bulletin 253 (1939). 

10. H. H. Love. Journal of the American Society of Agronomy, 16:64 (1924). 

11. H. H Mitchell, Wise Burroughs and Jesse R. Beadles. Journal of Nutrition, 

11:257 (1936). 

12. Irvine H. Page, Esben Kirk, William H. Lewis, Jr., William Thompson and 

Donald D. Van Slyke. The Journal of Biological Chemistry, 111:613 (1935). 

13. Raymond Pearl and F. M. Surface, USD A, Bureau of Animal Industry Bulletin, 

No. 110, Part I (1909). 

14. Chas. S. Reddy. Iowa State College Journal of Science, 9:530 (1935). 

15. S. C. Salmon. The Journal of the American Society of Agronomy, 22:77 (1930). 

16. -‘Student.” Biometrika, 6:1 (1908). 

17. Paul J. Talley. Plant Physiology, 9:737 (1934). 

18. R. K. Tam and O. C. Magistad. Plant Physiology, 10*161 (1935). 

19. W. J. Youden and Helen Purdy Beale. Contributions from Boyce Thompson 

Institute, 6:437 (1934). 



Chapter 3 

SAMPLING FROM A NORMALLY DISTRIBUTED POPULATION 


3.1 — Introduction. In chapter 1 the facts about confidence intervals 
were verified through sampling and this same device outlined the chi- 
square distribution which led to desired probabilities of occurrence. 
But in chapter 2 you were asked to accept S£ and t with almost no justifi- 
cation. Now we are ready to examine their distributions along with those 
of x> s 2 , and s> in the expectation of learning the reasons for the statements 
already made about them. 

There are two devices for studying the sampling properties of statistics. 
The first is that employed by the mathematician who arrives at the 
results through use of his symbolical logic. He furnishes the formulas and 
tables that make statistical methods accurate and convenient. The second 
device is the actual drawing of samples. This requires no mathematical 
training but makes it possible to verify many findings of probability 
theory. More important, the layman is enabled by his experience in 
sampling to appreciate the foundations upon which the mathematical 
logic is built. 

Both the mathematician and the experimenter have been eager to 
draw samples from actual populations which depart more or less from the 
mathematical models. The data may come from harvesting a crop in 
small plots, with perhaps a thousand measurements of some such character 
as yield (1) (5) (6). The geographical distribution of the variable is then 
known as well as the frequency distribution. The latter may be compared 
with the normal distribution, often assumed to be the source of samples, 
while the former is the basis for studying the effectiveness of various ex- 
perimental designs. 

For our purposes, we shall, as before, set up a particular kind of 
distribution and draw samples from it with the help of the table of random 
digits. But now, instead of having only two kinds of individuals to deal 
with, as in the enumeration statistics which have been studied, we are 
concerned with a population whose members theoretically have all values 
of some variate like weight. In this model distribution there is no limit 
to the number of different weights, but practically the scale for measuring 
can be read no closer than a certain least count. Hence, the sample that 
we have devised to simulate a normal population departs from the model 
in two respects; it is limited in size and range instead of being infinite, 
and has a discontinuous variate in lieu of the continuous one implied in the 
theory. The effects of these departures will scarcely be noticed because 
they are small in comparison with sampling variation. 

[ 54 ] 
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3.2 — A normally distributed sample. In table 3.1 are the weight 
gains of a hundred swine, slightly modified from experimental data so 
as to form a distribution which is approximately normal. The items are 
numbered from 00 to 99 in order that they may be identified easily with 
corresponding numbers taken from the table of random digits. The salient 
features of this kind of distribution may be discerned in figure 3.1. The 

TABLE 3.1 

Array of Gains in Weight (Pounds) of 100 Swine During a Period of 20 Days 
The gams approximate a normal distribution with 
m =* 30 pounds and <r = 10 pounds 


Item 


Item 


Item 


Item 


Number 

Gain 

Number 

Gain 

Number 

Gain 

Number 

Gain 

00 

3 

25 

24 

50 

30 

75 

37 

01 

7 

26 

24 

51 

30 

76 

37 

02 

11 

27 

24 

52 

30 

77 

38 

03 

12 

28 

25 

53 

30 

78 

38 

04 

13 

29 

25 

54 

30 

79 

39 

05 

14 

30 

25 

55 

31 

80 

39 

06 

15 

31 

26 

56 

31 

81 

39 

07 

16 

32 

26 

57 

31 

82 

40 

08 

17 

33 

26 

58 

31 

83 

40 

09 

17 

34 

26 

59 

32 

84 

41 

10 

18 

35 

27 

60 

32 

85 

41 

11 

18 

36 

27 

61 

33 

86 

41 

12 

18 

37 

27 

62 

33 

87 

42 

13 

19 

38 

28 

63 

33 

88 

42 

14 

19 

‘ 39 

28 

64 

33 

89 

42 

15 

19 

40 

28 

65 

33 

90 

43 

16 

20 

41 

29 

66 

34 

91 

43 

17 

20 

42 

29 

67 

34 

92 

44 

18 

21 

1 43 

29 

68 

34 

93 

45 

19 

21 

44 

29 

69 

35 

94 

46 

20 

21 

[ 45 

30 

70 

35 

95 

47 

21 

22 

| 46 

30 

71 

35 

96 

48 

22 

22 

47 

30 

72 

36 

97 

49 

23 

23 

48 

30 

73 

36 

98 

53 

24 

23 

49 

i 

30 

74 

36 

99 

57 


gains, clustering at the midpoint of the array, thin out symmetrically, 
slowly at first, then more and more rapidly. Two-thirds of the gains lie in 
the interval 30 =*= 10 pounds, that is, in an interval of two standard devia- 
tions centered on the mean. In a real population, indefinitely great in 
number of individuals, greater extremes doubtless would exist, but that 
need cause us little concern. 

The relation of the histogram to the array is clear. After the class 
bounds are decided upon, it is necessary merely to count the dots lying 
between the vertical lines, then make the height of the rectangle propor- 
tional to their number. The central value, or class mark , of each interval 
is indicated on the horizontal scale of gains. 

In table 3.2 is the frequency distribution graphically represented in 
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GAIN IN POUNDS 


Fig. 3.1 — Upper part: Graphical representation of array of 100 normally distributed 
gains in weight. Lower part: Histogram of same gains. The altitude of a rectangle in 
the histogram is proportional to the number of dots in the array which lie between 

the vertical sides. 
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figure 3.1. Only the class marks are entered in the first row. The class 
intervals are from 2.5 to 7.5, etc. This is a notably concise method of 
presenting the data in large samples. 

TABLE 3.2 

Frequency Distribution of Gains in Weight of 100 Swine 
A normally distributed sample 


Class mark (pounds) 5 10 15 20 25 30 35 40 45 50 55 

Frequency 2 2 6 13 15 23 16 13 6 2 2 


3.3 — Random samples from a normal distribution. An easy way to 
draw random samples from the table of pig gains is to take numbers con- 
secutively from the table of random numbers, table 1 .2, then match them 
with the gains by means of the integers, 00 to 99, in table 3.1. To avoid 
duplicating the samples of others, start at some randomly selected point 
in the table of random numbers instead of at the beginning, then proceed 
upwards, downwards, or crosswise. Suppose you have hit upon the digit, 
8, in row 71, column 29. This with the following digit, 3, specifies pig 
number 83 in table 3.1, a pig whose gain is 40 pounds. Hence, 40 pounds 
is the first number of the sample. Moving upwards among the random 
numbers you read the integers 09, 75, 90, etc., and record the correspond- 
ing gains from table 3.1, 17, 37, and 43 pounds. Continuing, you get as 
many gains and as many samples as you wish. 

Samples of 10 arc suggested. Unlike the procedure in chapter 1, for 
our present purposes the samples must all be of the same size because the 
distributions of their statistics change with n. It is well to record the items 
in columns, leaving a half dozen lines below each for subsequent computa- 
tions. For your guidance, four samples are listed in table 3.3. The com- 
putations below them will be explained as we go along. Draw as many of 
the samples as you think you can process with the time at your command. 
It is fortunate if several are working together so that the results of each 
can be made available to all. Keep the records carefully because you 
will need them again and again. 

It will be noted that each pig gain may be drawn as often as its number 
appears in the table of random digits — it is not withdrawn from circula- 
tion after being taken once. Thus, the sampling is always from the same 
population, and the probability of drawing any particular item is prac- 
tically constant throughout the process. 

EXAMPLE 3.1 — Determine the range m each of your samples of 10. The mean of 
these ranges is an estimate of 3 a = 30 pounds, table 5.5. Does your average range 
approximate 30 pounds? 

3.4 — The distribution of sample means. First add the items in each 
sample, then put down the mean, x (division is by 10). While each of 
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TABLE 3.3 

Four Samples op 10 Items Drawn at Random From the Pig Gains op Table 3.1, 
Each Followed by Statistics To Be Explained in Sections 3.4-3.8 


Item Number 

Sample Number 





and Formulas 

1 

2 

3 

4 

1 

33 

32 

39 

17 

2 

53 

31 

34 

22 

3 

34 

11 

33 

20 

4 

29 

30 

33 

19 

5 

39 

19 

33 

3 

6 

57 

24 

39 

21 

7 

12 

53 

36 

3 

8 

24 

44 

32 

25 

9 

39 

19 

32 

40 

10 

36 

30 

30 

21 

£ 

35 6 

29 3 

34 1 

19 1 

r 2 

169 8 

151 6 

9 0 

112 3 

s 

13 0 

12 31 

3 00 

10 6 

Si 

4 11 

3 89 

0 95 

3 35 

t 

1.36 

-0 18 

4 32 

- 3.25 

t 06 Si 

9.3 

8 8 

2 2 

7 6 

X ^ t 06-J i 

26 3 - 44.9 

20 5-38 1 

31 . 9-36 3 

11 5-26 7 


these means is an unbiased estimate of the parameter, 30 pounds, yet 
there is great variation among them. Make an array of the means of all 
your samples. If there are enough of them, group them into a frequency 
distribution like table 3.4. 

As indicated, our laboratory means ranged from 19 to 39 pounds, 
perhaps to the novice a disconcerting variability. Try to imagine doing 
an experiment resulting in one of these more divergent mean gains instead 
of the correct value, 30 pounds. Having no information about the popu- 
lation except that furnished by the sample, you would be considerably 
misled and there is no way to overcome this hazard. One of the objects 
of the experimental samplings you have done is to acquaint you with the 
risks involved in all conclusions based on small portions of the aggregate. 
The investigator seldom knows the parameters of the sampled popula- 
tion: he knows only the sample estimates. He learns to view his experi- 
mental data in the light of his experience of sampling error. His judg- 
ments involve not only the facts of his sample but all the related informa- 
tion which he and others have accumulated. 

The more optimistic are prone to draw satisfaction from the large 
ntimber of means near the center of the distribution. If this were not 
characteristic, sampling would not be so useful and popular. The im- 
probability of getting poor estimates produces a sense of security in making 
inferences. This is proper. With the right balance between confidence 
and prudence the experienced investigator is seldom a victim of the acci- 
dents of sampling. 
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TABLE 3.4 

Frequency Distribution of 511 Means of Samples of 10 Items Randomly Drawn 
From the Pio Gains of Table 3.1 . Theoretical Normal Frequency for Comparison 


Class Mark (pounds) 

Frequency 

Theoretical Frequency 
(For computation, 
see section 8.8) 

19 

1 

0.20 

20 

1 

0 41 

21 

0 

1.18 

22 

7 

2.71 

23 

5 

5.62 

24 

10 

10.78 

25 

19 

18.60 

26 

30 

29 02 

27 

41 

41 14 

28 

48 

52.07 

29 

66 

61.63 

30 

72 

64 23 

31 

56 

61.63 

32 

46 

52.07 

33 

45 

41.14 

34 

22 

29 02 

35 

24 

18.60 

36 

12 

10.78 

37 

5 

5 62 

38 

0 

2.71 

39 

1 

1 84* 

Total 

511 

511 00 


* Including subsequent classes. 


The distribution of means in table 3.4 brings to attention two more 
basic features of the sampling. First, the distribution is itself approxi- 
mately normal, as you may see by comparing the actual with the theo- 
retical normal frequencies. The theory is that means of samples drawn 
from a normal population are likewise normally distributed. We may go 
further and say that means tend to be distributed normally even if the 
parent population is considerably anormal, a great convenience in practi- 
cal applications where the exact form of the sampled population is usually 
unknown. The second feature to be noticed is that the range of the means, 
great as it is, is only about one-third that of the population of gains in 
table 3.1. Means are less variable than items, as we have had occasion 
to remark before. Discussion of this second feature will be resumed in 
section 3.6 below. 

The final characteristic to be noted in the distribution is its mean. 
This may be calculated by adding all the sample means and dividing 
by 511, but a shorter method will be explained in section 8.2. The value, 
29.87 pounds, is an unbiased estimate of m = 30 pounds. Clearly, if 
one could command large samples such as this he wouldn’t bother much 
about sampling variation. Also, this close approximation throws light 
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on the meaning of the term, unbiased. As the extent of the sampling has 
increased, the deviation of the sample mean from the true value has 
become very small. You remember that an unbiased sample is one 
whose mean tends toward the population value as n is increased. 

3.5 — Distribution of variance and standard deviation. s 2 y the 

average square of the deviations in each of your samples, may be com- 
puted as in table 2.1. Shorter methods are explained in sections 5.1 and 
5.2, the latter being especially easy if a calculating machine is available. 
In table 3.3 you will observe that three of these variances overestimate the 
population value, 100, while the fourth is notably small. Examine any of 
your samples with unusual variances to learn what peculiarities of the 
arrays are indicated. This freakish sample 3 in the table has a range of 
only 39 — 30 = 9 pounds with not a single item less than the population 
mean. Did you happen to get a sample with range equal to that of the 
population? 

The distribution of the variances of our 511 samples is displayed in 
table 3.5. Notice its skewness, with the clustering below the mean and a 


TABLE 3.5 

Frequency Distribution of 511 Variances of Samples With n = 10 Drawn From 
the Normal Distribution of Table 3.1 


Class mark 

0 20 40 60 80 100 120 140 160 180 200 220 240 260 280 300 320 340 

Frequency 

1 11 47 92 93 72 73 42 29 26 11 8 2 1 0 1 1 1 


long tail above — ^somewhat like the chi-square distribution of chapter 1, 
though less extreme. Despite this, the mean of these variances is 101.5, 
closely approximating the population value of 100 and verifying the fact 
that the sample variance is an unbiased estimate. 

You may think that we should go ahead with fiducial limits and tests 
of significance based on the variance distribution. Variance is a measure 
of dispersion similar to chi-square, and might be used in much the same 
fashion were it not that the more convenient t has been discovered. This 
will be discussed in section 3.7. 

Our distribution of s , the square root of the variance, in table 3.6 
conforms closely to theory. It shows a slight skewness (not so large as 

TABLE 3.6 


Frequency Distribution of 511 Standard Deviations Corresponding to the 

Variances of Table 3.5 


Class mark 

3 

4 

5 6 7 

8 

9 

10 

11 

12 

13 

14 15 

16 17 18 

Frequency 

1 

2 

9 18 58 

77 

80 

71 

79 

44 

41 

17 8 

3 1 2 
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that of s 2 ) as well as a small bias with mean 9.8 pounds, slightly less than 
a = 10 pounds. Thus, even in samples so small as 10 the bias of the 
standard deviation is negligible in single samples. 

EXAMPLE 3.2 — Theoretically, 50% of the means in table 3.4 should be smaller 
than 30 pounds. Assuming that half of the means in the interval with class mark 30 
pounds have values less than 30, the sample number of means less than 30 pounds is 
264, the remaining 247 being greater. Calculate adjusted chi-square on the hypothesis 
that in the population the distribution is 1:1. Ans. 0.50. 

EXAMPLE 3.3 — If half the standard deviations of table 3.6 were expected to be 
less than 10 pounds, calculate adjusted x 2 =* 4.70 for the sample. This is evidence 
against a symmetrical distribution in the population. 

3.6 — Standard deviation of the mean, or standard error. This 
quantity, one of the most useful in our kit of tools, is calculated from either 

Si = s/y/n or Si = \A 2 /n, 

whichever is more convenient. In each of your samples of 10, s x is an 
estimate of 

a x = a/\/n = 10/\/l0 = 3.162 pounds 

In your drawings you will experience the usual sampling variation, but 
the average of your standard errors should approximate the theoretical 
3.162 pounds. Since Si is slightly biased in the same manner as s , you 
would get a still more biased average if you calculated the mean of your 
standard errors. To get the most accurate estimate, add your unbiased 
variances of the means, 

Si 2 = s 2 /n , 

then extract the square root of their mean. In our 511 samples, for 
example, this average is easily got by taking the average of the variances, 
101.5, dividing by 10 and extracting the root, -%/ 10.15 = 3.186 pounds. 
This close approximation to the population value constitutes sampling 
evidence that s x is an estimate of cr x . But what is the nature of this a x of 
which Si is an estimate? As indicated in section 2.12, a x is the standard 
deviation of means of samples of n drawn at random from a normal popu- 
lation with standard deviation, <r. Verification of this theorem is found 
in the distribution of 511 means in table 3.4. Their standard deviation 
turns out to be 3.178 pounds (in section 8.3 is explained a way of calcu- 
lating this easier than that of table 2.1), a satisfactory approach to the 
theoretical 3.162 pounds. If we now combine these two pieces of informa- 
tion, we are prepared for the statement that each random sample affords 
an estimate, s *, of the standard deviation of means, or standard error, cr*. 
Ordinarily only the single sample is drawn. You have seen from examina- 
tion of your small samples that the estimate may be disconcertingly di- 
vergent, but we have now learned that it is only slightly biased in even 
small samples of 10. We are next to learn how it leads to fiducial state- 
ments and tests of hypotheses. 
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EXAMPLE 3.4 — Four hundred plants of a certain hybrid rye (3) produced their 
first flowers an average of 70.5 days after planting, the standard deviation being 6.9 
days. Estimate the standard deviation of means of similar samples. £.ns, xj » 0.34 day. 

EXAMPLE 3.5 — Calculate xj = 1.16 ounces in the following sample of birth 
weights of 12 pigs: 3, 5, 7, 8, 9, 10, 10, 11, 12, 13, 15, 17 ounces. 

EXAMPLE 3.6 — During the period from 28 to 70 days of age the mean gain in 
weight of 16 rats was reported as 1 38 grams with the standard error, 4 grams. Assuming 
normal distribution, estimate the range of gains in the sample. (Compare example 
2.16). Ans. From 110 to 166 grams. 

EXAMPLE 3.7 — If you were to draw a sample of 40 items from the pig gains of 
table 3.1, the standard error of the mean is an estimate of what population parameter? 
Ans. 1.58 pounds. You can try this conveniently by combining 4 of your samples of 10, 
taking them just as they came in your original drawing. 

EXAMPLE 3.8 — Provided you have 10 or more samples of 10, you can easily make 
a sample of 100. If you used the computational method of section 5.2 your 10 values 
of SX and SX 2 are quickly combined for calculation of Sx 2 in this sample of 100. x$, 
which may then be computed for this sample, is an estimate of what parameter? 
Ans. 1 pound. 

EXAMPLE 3.9 — From table 5.5 you may read the average range of samples of 
511 as about 6.08<r. Since the means of table 3.4 are drawn from a population in 
which cr ■= 10/\/10 3.16 pounds, calculate the average range expected in samples 

of 511 and compare the result with the actual range, 39 — 19 « 20 pounds. 

EXAMPLE 3.10 — It was reported that a lot of pigs gained at the rate of 1.4 pounds 
per day with standard error, 0.02 pound per day. The author failed to state the 
number of pigs in the lot. If you wish to get an idea of this number, you can use the 
relation, x* = s/\/n provided you know from experience that the coefficient of vari- 
ability of fattening swine is about 10%. This would lead to an estimate of the standard 
deviation as 10% of 1.4 « 0.14 pound per day. Then, \/n — s/si « 0.14/0 02 ■* 7; 
and finally, n «* 49 pigs. This value of the coefficient of variation, 10%, is fairly small 
for fattening swine, but may be reduced still further by efficient experimental methods. 


3.7 — Distribution of t. You became acquainted with this statistic in 
sections 2.12 and 2.13, but explanation had to be delayed until now. The 
formula is 

t = (x — m)/si 

In your sampling, m is known to be 30 pounds. Contrast this with section 
2.13 where the hypothesis was set up that m = 0, and where consequently 
t was equal to x/s iy a value completely determined by the sample. A 
second contrast is furnished by section 2.12 where the value of m was 
unknown. The two fiducial limits, l\ and k, were calculated by solving 
the equations, 

/.os = (x — h)/s£ and — /. 06 = (x — k)/st 

in which /. 0 s was a positive number taken from the table, and m was 
replaced first by /i, smaller than x, and then by h, larger. Clearly, we 
must know more about this useful quantity, /. 

Since x and s§ have already been calculated, the sample value of / 
may now be got by putting m — 30, the resulting formula being 

t = (x - 30) /s* 
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Here, t will be positive or negative according as x is greater or less than 
30 pounds. In the present sampling the two signs are equally likely, so 
you may expect about half of each. On account of this symmetry the 
mean of all your t y s should be near zero. 

The samples in table 3.3 were selected to illustrate the manner in 
which large, small and intermediate values of t arise in sampling. A small 
deviation, ( x — m), and/or a large standard error tend to make t sm&ll. 
Some striking combinations are put in the table, and you can doubtless 
find others among your samples. 

The distribution of the laboratory sample of s is displayed in table 3.7. 


TABLE 3.7 

Sample and Theoretical Distributions of t. Samples of 10. 
Degrees of Freedom, 9 


Interval of t 

Sample 

Theoretical 






| Cumulative 




Percentage 

Frequency 

Percentage 

Frequency 



From 

To 

Frequency 

One Tail 

Both Tails 


-3 250 

3 

0 6 

0 5 

100 0 


— 3 250 

~2 821 

4 

0 8 

0 5 

99 5 


— 2 821 

-2 262 

5 

1 0 

1 5 

99 0 


-2 262 

-1 833 

16 

3 1 

2 5 

97 5 


-1 833 

-1 383 

31 

6 1 

5 0 

95 0 


-1 383 

-1 100 

25 

4 9 

5 0 

90 0 


-1 100 

-0 703 

52 

10 2 

10 0 i 

85 0 


-0 703 

0 0 

132 

25 8 

25 0 

75 0 


0 0 

0 703 

126 

24 6 

25 0 

50 0 

100 0 

0 703 

1 100 

41 

8 0 

10 0 

25 0 

50 0 

1 100 

1 383 

32 

6 3 

5 0 

15 0 

30 0 

1 383 

1 833 

18 

3 5 

5 0 

10 0 

20 0 

1 833 

2 262 

13 

2 5 

2 5 

5 0 

10 0 

2 262 

2 821 

8 

1 6 

1 5 

2 5 

5 0 

2 821 

3 250 

2 

0 4 

0 5 

1 0 

2 0 

3 250 

; 

3 

0 6 

0 5 

0 5 

1 0 

: 

511 

100 0 

100 0 



The class intervals in the present table are, as in table 1 4, unequal, 
adjusted so as to bring into prominence certain useful probabilities in the 
tails of the distribution. The theoretical percentage frequencies are 
recorded for comparison with those of the sample. In the last two columns 
are the cumulative percentage frequencies which make the table con- 
venient for fiducial statements and tests of hypotheses. Examination of 
the table reveals that 2.5% of all lvalues in samples of 10 theoretically 
fall beyond 2.262, while another 2.5% are smaller than — 2.262. Combin- 
ing these two tails of the distribution, as shown in the last column, 5% 
of all V s in samples of 10 lie further from the center than 2.262, which is 
therefore called the 5% level of t. The 5% point , you will note, beyond 
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which 5% of each tail lies, is =*= 1.833. The utility of these and similar 
markers is to be explained. At the moment, make a distribution of your 
own sample fs to be compared with the theoretical distributions in the 
table. 

Cumulative percentage distributions of t for other sample sizes have 
been worked out and are presented in table 3.8. Because of symmetry, 
only the positive half of the /’ s are listed in the body of the table; that is, 
only those corresponding to the lower half of table 3.7. Along the top 
border may be read the cumulative percentages for the two tails combined, 
while on the left are degrees of freedom, n — 1. The boundaries of the 
class intervals in table 3.7 (0.703, 1.100, 1.383, etc.) may be found in the 
row specified by 9 degrees of freedom. 

3.8 — The fiducial statement. The theory of the fiducial interval may 
now be verified from your sampling. Each sample is the basis of a fiducial 
statement about the population mean. Put t = + 2.262, the 5% level 
for 9 degrees of freedom, in both of the equations given in the foregoing 
section — 

2.262 - (x - h)/s £ and -2.262 = (x - k)/s * 

Solving each, 

l\ = x — 2.262r* and / 2 = x + 2.262if 

Substituting x and sz from each sample, the two limits are calculated as 
illustrated in table 3.3. Finally, for any particular sample, if you say 
that the population mean lies between the limits /i and k you will be 
right or wrong; which it is may be determined readily because you know 
the parameter, 30 pounds. The theory will be verified if about 95% of 
your statements are right and about 5% wrong. 

Referring again to table 3.3, sample 1 warrants the statement that 
the population mean lies between U = 26.3 and k = 44.9 pounds, and 
we know in the present instance that this fiducial interval does contain m. 
On the contrary, samples 3 and 4 were selected to illustrate cases leading 
tip false statements, one because of an unusually divergent mean, the other 
because of a small standard deviation. Of the 511 laboratory samples, 

, 486 resulted in correct statements about m ; that is, 95.1% of the statements 
were true. The percentage of false statements, 4.9%, closely approxi- 
mated the theoretical 5%. Always bear in mind the condition involved 
in every fiducial statement at the 5% level — it is right unless a one-in- 
twenty chance has occurred in the sampling. 

Practical applications of this theory are by people doing experiments 
and other samplings without knowledge of the population parameters 
When they make fiducial statements they do not know if they are right 
or wrong — they know only the probability selected. It is possible though 
improbable that the first five samples you draw in your research career 
will lead to false statements, not because of unskillful manipulation but 



TABLE 3.8 
Values of t * 


Probability of a Larger Value of t , Sign Ignored 


D.f. 

0.5 

0.4 

0.3 

0.2 

0.1 1 

0 

.05 

0, 

.02 

0 

01 

D-f- 

1 

1.000 

1.376 

1 

.963 

3 

.078 

6.314 

12 

.706 

31 

821 

63 

657 

1 

2 

.816 

1.061 

1 

.386 

1 

.886 

2.920 

4 

.303 

6, 

.965 

9 

.925 | 

2 

3 

.765 

.978 

1 

250 

1 

.638 

2.353 

3 

.182 

4 

541 

5 

841 

3 

4 

.741 

.941 

1 

.190 

1 

533 

2 132 

2 

.776 

3. 

.747 ! 

4, 

.604 ! 

4 

5 

.727 

.920 

1 

.156 

1 

.476 

2 015 

2 

.571 

3. 

.365 

4 

.032 

5 

6 

.718 

.906 

1 

.134 

1 

.440 

1.943 

2 

447 

3. 

.143 

3 

707 

6 

7 

.711 

.896 

1 

.119 

1 

.415 

1 895 

2 

365 

2 

998 

3 

499 

7 

8 

.706 

.889 

1 

.108 

1 

.397 

1.860 

2 

.306 

2 

896 

3 

355 

8 

9 

.703 

883 

1 

100 

1 

.383 

1 833 

2 

.262 

2, 

.821 

3 

250 

9 

10 

.700 

.879 

1 

093 

1 

372 

1.812 

2 

228 

2, 

.764 

3 

169 

10 

11 

.697 

.876 

1 

.088 

1 

363 

1.796 

2 

.201 

2 

718 

3 

.106 

11 

12 

.695 

.873 

1 

083 

1 

356 

1.782 

2 

.179 

2 

681 

3 

.055 

12 

13 

694 

.870 

1 

.079 

1 

350 

1.771 

2 

.160 

2 

650 

3 

012 

13 

14 

692 

.868 

1 

076 

1 

.345 

1.761 

2 

145 

2, 

,624 

2 

.977 

14 

15 

.691 

866 

1 

074 

1 

341 

1 753 

2 

.131 

2, 

.602 

2 

947 

15 

16 

.690 

.865 

1 

071 

1 

337 

1.746 

2 

120 

2 

583 

2 

921 

16 

17 

. 689 

863 

1 

069 

1 

333 

1 740 

2 

110 

2 

567 

2 

898 

17 

18 

688 

.862 

1 

.067 

1 

330 

1 734 

2 

101 

2 

552 

2 

878 

18 

19 

.688 

.861 

1 

066 

1 

328 

1 729 

2 

.093 

2. 

,539 

2 

861 

19 

20 

687 

.860 

1 

064 

1 

325 

1 725 

2 

.086 

2 

.528 

2 

845 

20 

21 

686 

.859 

1 

063 

1 

323 

1 721 

2 

.080 

2 

518 

2 

831 

21 

22 

.686 

858 

1 

061 

1 

.321 

1.717 

2 

074 

2 

508 

2 

819 

22 

23 

685 

.858 

1 

.060 

1 

319 

1 714 

2 

069 

2 

500 

2 

807 

23 

24 

685 

.857 

1 

059 

1 

.318 

1 711 

2 

064 

2 

492 

2 

797 

24 

25 

684 

856 

1 

058 

1 

.316 

1.708 

2, 

.060 

2 

485 

2. 

.787 

25 

26 

.684 

.856 

1 

058 

1 

315 

1.706 

2 

056 

2 

479 i 

2 

779 

26 

27 

684 

855 1 

1 

057 | 

1 

314 

1 703 | 

2 

052 

2. 

473 i 

2 

771 

27 

28 

683 

855 

1 

.056 

1 

313 

1 701 

2 

048 

2. 

,467 

2 

763 

28 

29 

.683 

854 

1 

055 

1 

.311 

1.699 

2 

045 

2. 

462 

2 

756 

29 

30 

.683 

854 

1 

055 

1 

310 

1 697 

2 

042 

2 

457 

2 

750 

30 

35 








2 

030 



2 

724 

35 

40 








2 

021 



2 

704 

40 

45 








2 

014 



2 

690 

45 

50 








2 

008 



2 

678 

50 

60 








2 

000 



2 

660 

60 

70 







: 

1 

994 



2 

648 

70 

80 








1 

990 



2 

638 

80 

90 








1 . 

987 



2 

632 

90 

100 








1 

984 



2 

626 

100 

125 








1 . 

.979 



2. 

616 

125 

150 








1 

976 



2 

609 

150 

200 








1 . 

972 



2 

601 

200 

300 








1 

968 



2 

592 

300 

400 








1 

966 



2 

588 

400 

500 








3 

965 



2 

586 

500 

1000 








1 

962 



2 

581 

1000 

00 

.67449 

.84162 

1 . 

03643 

1 

28155 

1 64485 

1.95996 

2. 

32634 

2 

57582 

00 


* Partly from table IV of “Statistical Methods for Research Workers” with per- 1 
mission of the author, R. A. Fisher, and of his publishers, Oliver and Boyd, Edinburgh. 
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through an unlucky quirk of chance. Let us hope that the wise counsel 
of a more experienced leader would guide you around such a sampling 
pitfall. 

I trust you now have a clear view of the nature of the fiducial inference, 
and that you will make use of it with a happy combination of caution and 
confidence. In the following sections we shall study the third inference 
involving certain hypotheses about populations. 

EXAMPLE 3.11 — Using the sample frequencies of table 3.7, test the hypothesis 
(known to be true) that the /-distribution is symmetrical in the sense that half the popu- 
lation frequency is greater than zero. Ans. x 2 “ 11 3. 

EXAMPLE 3.12— From table 3.7 it is seen that 3 + 44 - 54 - 8 + 2 + 3 * 25 
samples have |/| > 2.262. Test the hypothesis that 5% of the population values are 
greater than 2.262. Ans. x 2 * 0.0001. 

EXAMPLE 3.13 — In table 3.7, accumulate the sample frequencies in both tails 
and compare their percentage values with those in the last column of the table. 

EXAMPLE 3.14 — During the fall of 1943, approximately one in each 1,000 city 
families of Iowa (cities are defined as having 2,500 inhabitants or more) was visited to 
learn the number of quarts of food canned. The average for 300 families was 165 
quarts with standard deviation, 153 quarts. Calculate the 95% fiducial limits. Ans. 
165 17 quarts. 

EXAMPLE 3.15 — The 1940 census reported 312,000 dwelling units (roughly the 
same as families) in Iowa cities. From the statistics of the foregoing example, estimate 
the number of quarts of food canned in Iowa cities in 1943. Ans. 51,500,000 quarts 
with 95% fiducial limits, 46,200.000 and 56,800,000 quarts. 

EXAMPLE 3.16 — Compute the 99% fiducial limits of the mean in example 3.14 
Ans. 165 =*= 23 quarts. 

3.9 — Differences. Test of null hypotheses. Beginning with section 
1.6 and again with section 2.13 you learned how to test useful hypotheses 
pertaining to sampled populations. It is time to examine the latter 
section more in detail, using the familiar technique of drawing samples 
from the hypothetical population. In practice, the hypothesis most often 
set up is that there is no difference between the two populations sampled; 
that is, that the contrasted members of the pairs are drawn from the same 
source. A convenient model is a normally distributed population of 
differences with mean equal to zero. We can easily sample such a popu- 
lation by taking advantage of this theorem: if items are drawn at random 
from a normal population, then randomly paired, the differences between 
pairs are normally distributed with zero as mean. Merely copy down 
your samples in pairs, as in table 3.9, subtract each item in the second 
from the corresponding item in the first, and use the differences as the 
new samples. Naturally, the samples must be taken just as they came from 
the original drawing, without selection as to which shall be first and which 
second, and there must be no rearrangement of the items. Two of the 
pairs in the table were selected from our 511 samples to illustrate features 
to be commented on below. The third is a made-up sample illustrating 
a theorem that will be needed later. 

These two facts will be noticed right away in the sampling: (i) the 
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TABLE 3.9 

Two Samples of Differences Drawn at Random From Table 3.1, 
and a Made-Up Sample * 
m ■* mi — mt *= 0. <r\ 2 *■ (n* ** 100. <r* *= 200 


1. Random 

2. Random 

3. Selected 

Paired Items 

Differences 

Paired Items 

Differences 

Paired Items 

Differences 

X l 

Xt 

X 


x t 

X 

Xi 

Xt 

X 

39 

17 

22 

19 

33 

-14 

32 

22 

10 

34 

29 

5 

14 

30 

-16 j 

31 

17 

14 

22 

30 

- 8 

57 

41 

16 

28 

34 

- 6 

27 

36 

- 9 

34 

42 

- 8 

24 

24 

0 

33 

41 

- 8 

39 

33 

6 

44 

12 

32 

42 

30 

12 

34 

21 

13 

53 

29 

24 

36 

3 

33 

39 

36 

3 

9 

23 

-14 

24 

23 

1 

13 

33 

-20 

35 

37 

- 2 

25 

38 

-13 

39 

33 

6 

33 

21 

12 

29 

30 

- 1 

23 

22 

1 

31 

11 

20 

£ 


3 4 



- 1 3 



9 

s* 


222 9 



155 6 



207.4 

S£* 


22 29 



15 56 



20.74 

Si 


4 72 



3.94 



4.55 

t 


0 72 



- 0.33 



1 98 


differences, both in size and number, are almost equally divided between 
pluses and minuses, resulting in means falling with equal likelihood above 
and below zero; and (ii) the variation of the differences is much greater 
than that of the original population. The reason for this second fact is 
apparent when you observe that the possible range of differences is from 
3 — 57 = —54 pounds to 57 — 3 = 54 pounds, a range of 54 — ( — 54) 
= 108 pounds, twice the range of the gains in table 3.1. You will not be 
surprised, then, by these theorems: (i) the variance of random differences, 
<r 2 , is double that of the original population; and (ii) the variance of each 
sample of differences, s 2 , is an unbiased estimate of 2cri 2 , (= 2 <r 2 2 ). So if you 
average the variances of all your samples of differences, the result should 
be not far from 200, twice the variance of the pig gains. I had 144 such 
variances with the mean, 207.9. You are doubtless impressed, perhaps 
startled, by the variability of s 2 in your samples. 

You are now prepared for another theorem: not only the differences 
but also the means of the differences are normally distributed about m = 0, 
the variance of the means being cr* 2 = 2<n 2 /n . Again, the variance of each 
sample mean, j* 2 , is an unbiased estimate of a* 2 . In verification, I found 
the average of my 144 mean differences to be 0.51 pound with variance, 
22.2, instead of the expected m = 0 and <r± 2 = 20. 

This variance, double that in the original population, may be dis- 
concerting to the many who expect to set up experiments for evaluating 
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differences. Being confronted by such great variation, how can they hope 
to get means with fiducial intervals small enough to be of practical use? 
Obviously, larger samples would do it, but such may not be feasible. 
Some devices must be found for reducing the error encountered, or else 
random pairing must be avoided whenever possible. Both alternatives 
are to be considered in the next section. At the moment we must look at 
the values of t in our samples of differences. Since m is now zero, the 
formula is 

t ^ % f 

as in section 2.13. The calculation is illustrated in table 3.9. Here is an 
interesting new feature. Since the differences have greater variability 
than the original gains, both x and s& have wider ranges; but the ratio, t, 
is distributed exactly as in table 3.7, a fact substantiated by the distribu- 
tion of table 3.10. Clearly, these 144 /’ s might be added to the previous 


TABLE 3.10 

Distribution of 1 44 Vai ues of t in Samples of Differences 
Randomly Drawn From Table 3 1 ; n = 10 


Class Interval* 

1 requency 

Percentage 

I requenev 

Theoretical 

Percentage 

Frequencv 


-2 821 

0 

0 00 

1 

-2 821 

-1 833 

6 

4 17 

4 

-1 833 

-1 100 

11 

7 64 

10 

-1 100 

0 000 

48 

33 33 

35 

0 000 

1 100 

55 

38 20 

35 

1 100 

1 833 

13 

9 03 

10 

1 833 

2 821 

10 

6 94 

4 

2 821 


! l 

0 69 

1 


144 

100 00 

100 


* Because this sample is small, each pair of intervals in table 3 7 is here combined 
into one. 


511, making a sample of 655 values all randomly and independently 
drawn from a single population of t. This justifies the use of the same table 
of t for samples of gains and for samples of differences. For any one size 
of sample the distribution of t is contingent only upon normality in the 
population sampled, irrespective of m or cr. This is one reason for the 
utility of the statistic. 

In your samples of differences, did you get any values of t larger (in 
absolute value) than the 5% level, 2.262? Among my 144 samples I got 
10 instead of the expected 7. One was greater than 3.250, the 1% level. 
You are well aware by now that if these were experimental results, the 
null hypothesis might have been rejected in these 10 samples though it 
was, in fact, true. 
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EXAMPLE 3.17 — Collins, Flint, and McLane (2) investigated the effect of a small 
electric current on the growth of corn seedlings. The seedlings were grown in pairs of 
boxes, one of each pair treated, the other not. The differences in elongation, treated- 
untreated, were as follows: 6.0, 1.3, 10.2, 23.9, 3.1, 6.8, —1.5, —14.7, —3.3, 11.1 mm. 
Calculate t «* 1.33. 

EXAMPLE 3.18 — An agronomist interested in the effect of superphosphate on 
corn yield tried adding the fertilizer to a treatment of manure and lime. Five pairs 
of plots were tested. The plots with superphosphate yielded 20, 6, 4, 3, and 2 bushels 
per acre more than their parallels. Was the value of superphosphate demonstrated? 

EXAMPLE 3.19 — The foregoing experiment was tried again. The second time, 
the plots receiving superphosphate yielded 5, 6, 4, 3, and 2 bushels per acre more 
than their parallels. Was this better evidence of the value of superphosphate than the 
former? If you think merely of the samples, the answer seems silly. If you fix your 
attention on the populations, the results are reasonable. The sample estimate indi- 
cates that the first population is highly variable, so that small samples drawn from it 
are unreliable. 

EXAMPLE 3.20— The following are the yields in bushels per acre of 22 corn 
crops from Iowa farms on Carrington loam. The data were collected by Boatman 
and Baker in 1918-1924. The farms are in the locality of the towns indicated. Is the 
mean difference significant? Do you suppose the lime directly affected the yield of 
corn? 


Locality 

Eldora 

Gruver 

i 

Jesup 

Scran- 

ton 

Year 

1919 1920 

1923 1924 , 

1921 1922 

1924 

Fertilized with manure only 

50 62 

38 60 

73 66 

43 

Fertilized with manure and lime 

52 66 

40 64 | 

78 71 

46 





- _ - 


— - — = 


Truesdale, 2 farms 

Indepen- 

dence 

May- 

nard 

Waverly, 3 farms 

1918 1919 1920 1921 

1922 

1924 

1921 1924 

1923 

1918 

1920 

1922 1922 

1923 

1923 

44 

57 57 40 

62 

24 

64 58 

54 

54 

89 

58 58 

40 

25 

46 

58 59 42 

64 

32 

69 59 

60 

57 

88 

60 62 

57 

30 


EXAMPLE 3.21 — It is reported that, in a scries of 15 variety trials, the difference 
in plot yields, A — B, ranged from —6 to 16 bushels per acre. Can you estimate 
t = 3.2 for testing the significance of the mean difference between A and B ? 

EXAMPLE 3.22 — In reporting the results of his research, an investigator stated 
the mean difference and standard error of his differences but omitted the sample size. 
He did say, however, that his f-value was greater than the 1 % level, 2.861. Would you 
deduce the size of his sample as 20 differences? 

EXAMPLE 3.23 — Two lots of 10 rats each, paired as usual, were fed rations pro- 
ducing a mean difference of 4.4 grams in weight, with standard deviation 9.4 grams. 
Since the difference is not significant, the question is raised as to whether more rats 
might demonstrate a significant superiority of one of the rations. Of course, the 
population gains corresponding to the two rations may not be different. Assuming, 
however, that they may be, and assuming further that a trial with more rats will pro- 
duce a mean difference of not less than 4.4 grams, and a standard deviation of not 
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more than 9.4 grams, find the number of pairs necessary to make t ** 2 (section 2.14). 
Ana. About 19 pairs. 

EXAMPLE 3.24 — To get a clear idea of the 5% values of / in table 3.8, make a 
graph with degrees of freedom along the horizontal axis and / plotted against them 
vertically. The curve through such points will fall away rapidly at first, then more 
slowly, approaching the value, / ** 1.96, as a limit. If carefully drawn, this graph may 
be used for interpolating /-values between those given in the table. 

EXAMPLE 3.25 — For an interesting account of some of the early testing of experi- 
mental data from paired individuals, read Fisher’s Design of Experiments (4), chapter 3. 
It is there shown that even Darwin and Galton were uncertain of the principles govern- 
ing this test of significance. 

All the facts necessary for reading the earlier sections of chapter 4 have been presented . For a 
brief first course , therefore , the remainder of this chapter may be omitted. 

3.10 — Reducing the error involved in sampling. Having observed 
the large variation in random differences, we felt that something must be 
done to avoid such inaccuracy in experimentation. Extending the sam- 
pling to great numbers of pairs would decrease to any specified value, 
but this process would be expensive. Furthermore, since practical sam- 
plings are often from populations restricted in size, any great expansion of 
n may be impossible. Therefore, samplers have sought other methods for 
increasing accuracy. 

Three additional ways of reducing sampling errors are in common use: 
(i) selecting or developing populations in which variation is known to be 
small, (ii) subdividing the aggregates into sub-populations or strata of 
similar individuals, and (iii) utilizing information contained in related 
variates. These methods may be used in all combinations. Some details 
follow immediately, but most of them are distributed throughout the 
remainder of the book. 

It is sometimes practicable to get information by sampling populations 
whose variation is small. For example, one might decide to restrict his 
inquiry about radio program listeners to women in a certain city, question- 
ing only those who are housekeepers with one or more children and 
whose family income is between, say, $2,000 and $3,000. The variable 
being measured, such as amount spent for toilet soap, might have little 
range within this restricted population. Clearly, no specific information 
would be got about other parts of the aggregate, but the information 
about the population sampled should be relatively accurate. Again, 
for certain problems in nutrition of rats it seems adequate to sample a 
highly inbred colony whose range in 28-day weight, for example, is 
half that resulting from random mating. Standard errors for means of n 
such animals would be half those in the randomly mated population; 
consequently, fiducial intervals would be decreased by the same fraction 
while sample values of t would be doubled. 

The facts just stated can be verified by sampling from the normal 
population represented by table 3.11. This population has a = 5 pounds 
instead of the 10 pounds of table 3.1, the range also being half the previous 
value. Since the facts seem obvious, it will scarcely pay you to draw a 
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TABLE 3.11 

Array of 100 Gains Having Half the Range and Standard Deviation as That 
in Table 3.1. Distribution Is Approximately Normal With m « 30 Pounds and 

3 5 Pounds 


Item No. 

Gain 

Item No. 

Gain 

Item No. 

Gain 

Item No. 

Gain 

00 

17 

25 

27 

50 

30 

75 

33 

01 

19 

26 

27 

51 

30 

76 

34 

02 

20 

27 

27 

52 

30 

77 

34 

03 

21 

28 

27 

53 

30 

78 

34 

04 

22 

1 29 

27 

54 

31 

79 

34 

05 

22 

30 

27 

55 

31 

80 

34 

06 

22 

31 

28 

56 

31 

81 

34 

07 

23 

32 

28 

57 

31 

82 

35 

08 

23 

33 

28 

58 

31 

83 

35 

09 

23 

34 

28 

59 

31 

84 

35 

10 

23 

35 

28 

60 

31 

85 

35 

11 

24 

36 

28 

61 

31 

86 

35 

12 

24 

37 

29 

62 

32 

87 

36 

13 

24 

38 

29 

63 

32 

88 

36 

14 

25 

39 

29 

64 

32 

89 

36 

15 

25 

40 

29 

65 

32 

90 

36 

16 

25 

41 

29 

66 

32 

91 

37 

17 

25 

42 

29 

67 

32 

92 

37 

18 

26 

43 

29 

68 

33 

93 

37 

19 

26 

44 

29 

69 

33 

94 

38 

20 

26 

45 

30 

70 

33 

95 

38 

21 

26 

46 

30 

71 

33 

96 

39 

22 

26 

47 

30 

72 

33 

97 

40 

23 

26 

48 

30 

73 

33 

98 

41 

24 

27 

49 

30 

74 

33 

99 

44 


lot of samples from this table — its real use will come later. It is probably 
sufficient to look at the distribution of 400 means of 10 items drawn in 
the laboratory, table 3.12. The range is seen to be about half that in 
table 3.4. The average of the means in table 3.12 is 30.0 pounds, while 
their standard deviation is 1.57 pounds instead of the expected 

o'* = <r/y/n — S/\/\0 = 1.58 pounds 

Obviously, this scheme of reducing sampling variation by sampling from 
small-ranged populations is effective. Unfortunately, there are not many 
circumstances in which the method is available. Usually the information 
desired is about existent populations over whose variation the sampler 


TABLE 3.12 

Frequency Distribution of Means of Samples of 10 Items Drawn at Random 
From the Normal Distribution in Table 3.11 


Class mark (pounds) 

26 

27 

28 

29 

30 

31 

32 

33 

34 

Frequency 

5 

20 

43 

80 

99 

85 

48 

18 

2 
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has no control, and the less variable population may not be representative 
of the other. 

The second method of reducing error variation, by sub-division of the 
aggregate into relatively homogeneous strata, has been found useful in a 
surprising variety of situations. Most sampling and experimental designs 
employ it either exclusively or in connection with other devices. The 
reader will be reminded of it many times as he pursues this text, and later 
will be referred to more specialized discussions. An instance of this second 
method is the sampling from pairs of half leaves described in section 2.13, 
where the concentration of a virus was being examined. The leaf areas 
to be inoculated with the two preparations were chosen adjacent because 
such had been found to react more nearly alike than areas on different 
leaves. Therefore, if one half of each leaf is rubbed with preparation 
number 1 and the other half with number 2, the lesion differences are 
expected to vary less than they would if the two preparations were applied 
to half leaves selected by chance from different plants. 

Was this design successful in the tobacco virus experiment? For an 
answer, compare the experimental variance, 18.57 as calculated in 
table 2.2, with an estimate of what it would have been in a completely 
random arrangement. While the correct way of making this estimate is 
more conveniently presented in section 11.5, a good approximation is got 
by calculating the lesion variance on the 8 half-leaves inoculated with the 
first preparation, then on the 8 with the second, and finally averaging the 
two. Clearly, the variation due to treatment is not included in this aver- 
age, but only that arising from the differences among half-leaves treated 
alike. The two variances, one computed from each of the two columns of 
half-leaf lesions in table 2.2, are 66.86 and 24.57, the average being 45.71. 
To compare this with the actual experimental variance, one must re- 
member that 18.57, the variance of differences, is an estimate of 2a 2 . 
The proper comparison, then, is 18.57/2 = 9.28 with 45.71. Thus, the 
variance involved in evaluating this experiment is only about 20% of 
that which would have been encountered with random selection of half- 
leaves. This means that, for equal accuracy, five times the number of 
plants would be required if the pairing of contiguous half-leaves were not 
practiced. Hence, it is said that the efficiency of this experiment was five 
times as great as it would have been with random choice. Another way of 
saying the same thing is that the information per plant was five times that 
in the nonstratified experiment. (In chapter 11 you will find that the 
unbiased estimate of this factor is 4.66 instead of 5). 

As will be emphasized later, the efficacy of this method of stratification 
depends on foreknowledge of the behavior of experimental material, and 
this can be gained only by experience. No advantage is to be expected 
unless the members of a pair tend to perform in a similar fashion when 
treated alike. If these members differ as much as the pairs differ, then 
the stratification is no better than random pairing. 
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Does stratification imply a desertion of the fundamental principle of 
randomness? Not at all. It is merely a device for utilizing any knowledge 
one may have of outcome in order to exclude from the estimate of error 
that variation which he can anticipate. In the virus experiment two 
sources of variation had been learned from experience: (i) the differ- 
ences from plant to plant, and (ii) the differences among leaves on the 
same plant. Such variability was avoided in estimating experimental 
error by trying the two preparations on halves of the same leaf. Random- 
ness was attained by allotment of the two preparations to the right and 
left halves by some such mechanical device as the toss of a coin or a table 
of random digits. The variation entering the estimate of error was only 
that due to random differences in the reactions of two areas of the same 
leaf. 

The third method of controlling error variation, by use of additional 
information from the measurement of related variates, will be presented 
in chapter 6. 


EXAMPLE 3.26 — In example 2.27 were given the biological values of raw and 
roasted peanuts as determined in 10 pairs of rats The standard deviation of the 
differences was 7.41, so that the experimental variance per rat is estimated as (7.41) 2 /2 
= 27.5. Calculate the average variance among the rats in the two groups. Ans. 26. 
It appears then that, so far as biological values are concerned, the pairing was less 
efficient than random assortment. 


EXAMPLE 3.27 — If the ninth pair, 44, 62, had not turned up in example 2.27, 
computation will show that the paired sample would have been judged 9 times as 
efficient as the unpaired. Since such radical changes in attitude depend on one pair 
of rats, the investigator learns to be cautious 


3.11 — Summary. By experience you have verified a number of useful 
theorems about sampling from normally distributed populations of 
measurement variates. Means of samples of n are normally distributed 
with mean, m, but with standard deviation, <r/\/n. Each random sample 
yields unbiased estimates of m , a 2 and <j-}. 

The sample, together with some hypothesis about m , yields the statistic, 
/ = (!*“ w)/si, symmetrically distributed about the mean, zero. With a 
table of the distributions of /, one may make fiducial inferences about the 
mean of the population from which the sample is drawn. 

Random differences from a single population with variance, ai 2 , are nor- 
mally distributed about zero mean with variance, <r 2 = 2<ri 2 , means of 
such differences being similarly distributed with <Tx 2 = 2cri 2 /n. The sample 
value of t — x/s £ furnishes a test of the hypothesis that m = 0. 

There are three methods of avoiding the variability of random differ- 
ences: (i) selecting less variable populations, (ii) subdividing the popula- 
tion into strata known to be less variable, and (iii) measuring a related 
variate. 
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Chapter 4 

THE COMPARISON OF TWO GROUPS 


4.1 — Introduction. Proposed in the foregoing chapter were three 
devices for decreasing or avoiding the disagreeable variation that may be 
found in random differences. If no one of these schemes is feasible it 
may be wise, in making comparisons, to refrain from the pairing of indi- 
viduals, merely assigning them at random to two groups (lots) and apply- 
ing one treatment to each group en masse. It will then be only the group 
means that are compared, not the individual measurements. The varia- 
tion among the individual measurements, however, should be the basis 
for an estimate of experimental error. An example will illustrate this type 
of comparison. 

4.2 — An experiment comparing two groups having equal numbers 
of individuals. Using male chicks, Breneman (2) contrasted the 15-day 
mean comb weight of two lots, one receiving sex hormone A (testosterone) 
and the other C (dehydroandrosterone). Day-old chicks, eleven in num- 
ber, were assigned at random to each of the treatments. The two lots, 
caged together, were distinguished by red and purple head-stains. The 
individual comb weights are recorded in table 4.1. 


TABLE 4.1 

Weight of Combs* (15th Day) of Male Chicks Given Hormones A and C 


Hormone 

Number 
of Chicks 

Weight of Comb (milligrams) 

Mean 

A 

11 

57 120 101 137 119 117 104 73 53 68 118 

97 

C 

11 

89 30 82 50 39 22 57 32 96 31 88 

56 


* Slight alterations of the original data have been made to simplify computations. 


Since there was no basis for pairing the birds, no individual differ- 
ences are computed. It is the difference of two means, x = 97 — 56 = 41 
mg., that is an unbiased estimate of the difference between the means in 
two populations of comb weights characterized by the hormones ad- 
ministered. 

To arrive at an estimate of the variation of £, one naturally turns 
to the sampling experiments of the previous chapter. He may use the 
means of samples drawn from the population of pig gains, table 3.1. If 
these means are paired at random, as in table 4.2, the differences will vary 

[ 75 ] 
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TABLE 4.2 

Randomly Assorted Means of Samples of 10 Drawn From the Distribution of 
Table 3.1, Together With Differences 
Pounds 


Xi 

Means 

x, 

Difference 

£ 

Means 

Xi 

U 

Difference 

X 

26 

25 

1 

31 

30 

1 

28 

26 

2 

33 

29 

4 

32 

37 

-5 

33 

24 

9 

31 

30 

1 

37 

32 

5 

31 

35 

-4 

26 

31 

-5 

29 

36 

-7 

34 

27 

7 

35 

29 

6 

24 

29 

— 5 

32 

34 

-2 

29 

33 

-4 

29 

33 

-4 

26 

30 

-4 

33 

28 

5 

38 

28 

10 

30 

33 

— 3 

24 

32 

— 8 

33 

27 

6 

30 

31 

*-1 

32 

29 

3 

31 

27 

4 

22 

25 

-3 

32 

26 

6 

21 

30 

— 9 

35 

28 

7 

29 

26 

3 

30 

28 

2 

28 

34 

-6 

31 

33 

-2 

31 

36 

-5 

26 

33 

-7 

29 

30 

-1 

29 

22 

7 

28 

30 

-2 

27 

28 

-1 

29 

32 

-3 

32 

30 

2 

30 

34 

— 4 

29 

28 

1 

26 

30 

-4 

26 

27 

-1 

28 

27 

1 

25 

27 

-2 

30 

33 

— 3 

33 

29 

4 


Variance of differences, 22.42. Standard deviation of differences, 4.7 pounds. 


by an amount that will indicate the variance desired. Computation in 
the usual fashion results in s£ = 22.42, somewhat more than twice the 
variance of means which was encountered in the sampling of section 3.6, 
and almost the same as the variance of mean differences found in section 
3.9. Indeed, the theory is the same whether the mean of differences or the 
difference of means is examined. In both processes we are estimating 
2cr 2 / n = 20. This is the clue we need to solve the problem of group 
comparisons. If we estimate the variance, <r 2 , of the sampled population, 
we can then estimate the variance of the differences between sample 
means. As always in actual sampling these estimates are furnished by 
the sample statistics. 

Turning then, to table 4.3, two features may be noted: (i) each sample 
furnishes an estimate of variance, and (ii) the two estimates are combined 
or pooled by adding the two sums of squares, then dividing by the total 
degrees of freedom. Use of this pooled variance with dj. = 2{n — 1) 
enables us to take advantage of the /-distribution, because t calculated as 
in table 4.3 is distributed in the same manner as heretofore (table 3.8). 
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TABLE 4.3 

Summary of Statistics for Comparison of Two Groups of Equal Size. 
Chick Data of Table 4.1 



Number 

Degrees 



Hor- 

of 

of 

Mean Comb Weight 


mone 

Chicks 

Freedom 

(milligrams) 

i 

Sum of Squares 

A 

11 

10 

97 

8,472 

C 

11 

10 

56 

7,748 

Sum = 20 

Difference = x = 41 

Sum - *S* 2 ~ 16,220 


Pooled variance = j 2 = 16,220/20 = 811 

= V2 j*/« * VmWTl - v/147.5 « 12.14 mgm. 
t = 41/12.14 = 3.38. P < 1% 


The remaining calculations follow familiar courses. The variance of 
means is estimated by s 2 /n = 811/11, while that of differences between 
means is double this, or s± 2 = 2s 2 /n = 147.5. 

We are now ready for a test of significance. This experiment was set 
up paitly to learn if the hormones differentiated the chicks into two 
separate populations with contrasting means. To get evidence on this, 
we make the hypothesis that the populations have the same m y then submit 
this hypothesis to test in the usual manner. We are asking if the sample 
difference, x = 41 mg., may be no more than sampling variation from 
the hypothetical difference, zero. We have x = 41 mg., m = 0, s± 
= 12.14 mg., and 

t = 41/12.14 - 3.38 


For 20 degrees of freedom, table 3.8 indicates small chance of a larger 
value of t if sampling is really from the hypothetical population. The 
evidence is convincing that these hormones differentiate comb weights 
into two populations with unequal means. 

After the reader has mastered the argument leading to table 4.3 he 
may find it convenient to summarize the calculations by use of this 
formula: 



n{n — 1) 
6* 2 


in which x designates the difference between the two group means and 
Sx 2 is the pooled sum of squares. Substituting the chick data, x == 97 — 56 
= 41 mg., n = 11, and Sx 2 = 8,472 + 7,748 = 16,220. Then 


/ = 41 


4 


(liXio) 

16,220' 


3.38, as before. 
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Having reached the conclusion that there are two population means 
instead of one, it is informative to set fiducial limits to their difference, of 
which 41 mg. is an unbiased estimate. Since f.os = 2.086 for dj. = 20, 
the 95% limits are 

41 - (2.086) (1 2. 1) and 41 + (2.086)(12.1), 

that is, 16 and 66 mg. If the experimenter states that the difference 
between the means of the two populations is within these limits he will be 
right unless a one-in-twenty chance has come off. 

EXAMPLE 4.1 — From the normally distributed pig gains of table 3.1 1, two samples 
of 10 were drawn at random. With minor alterations to make calculation easier, the 
gains in weight (pounds) were: 

Sample 1: 35, 27, 31, 36, 40, 32, 29, 27, 30, 33 
Sample 2: 24, 29, 26, 31, 32, 28, 26, 35, 32, 27 

The mean difference is 3 pounds with pooled variance, 14.44. Calculate t ® 1.76. 
Note that the variance, 14.44, is an estimate of the population variance, 25. 

EXAMPLE 4.2 — In the foregoing example, the pooled sum of squares was 260. 
Use the formula, t = x\/'n(n — 1 )/Sx\ to calculate t. 

EXAMPLE 4.3 — Lots of ten bees were fed two concentrations of syrup, 20% and 
65%, at a feeder half a mile from the hive (7). Upon arrival at the hive their honey 
sacs were removed and the concentration of the fluid measured. In every case there 
was a decrease from the feeder concentration. The decreases were: from the 20% 
syrup, 0.7, 0.5, 0.4, 0.7, 0.5, 0.4, 0.7, 0 4, 0 2, and 0.5; from the 65% syrup, 1.7, 2.8, 
2.2, 1.4, 1.3, 2.1, 0.8, 3.4, 1.9, and 1.4%. Show that t » 5.6 if m » 0. There is little 
doubt that, under the experimental conditions imposed, the concentration during 
flight decreases more with the 65% syrup. See section 4.6 for further discussion. 

EXAMPLE 4.4 — In a trial of the effect of supplementary foodstuffs on the iodine 
number of back fat, four pigs were placed in each of two lots. In the lot fed tankage 
the iodine numbers were 65.5, 65.3, 67.1, and 73.7; while in the lot where 5% of soy- 
beans was substituted for the corresponding amount of corn, they were 71.5, 76.4, 
72.0, and 72.1. Show that the difference between means, 5.1, with si =« 2.28, is not 
quite significant. Ans. I » 2 24. 

EXAMPLE 4.5 — Four determinations of the pH of Shelby loam were made with 
each of two types of glass electrode (5). With a modified quinhydrone electrode, the 
readings were 5.78, 5.74, 5.84 and 5.80, while with a modified Ag/ AgCl electrode, 
they were 5.82, 5.87, 5.96, and 5.89. With the hypothesis that m ■* 0, calculate t = 2.66. 

EXAMPLE 4.6 — Fifteen kernels of mature Iodent corn were tested for crushing 
resistance. Measured in pounds the resistances were: 50, 36, 34, 45, 56, 42, 53, 25, 
65, 33, 40, 42, 39, 43, 42. Another batch of 15 kernels was tested after being har- 
vested in the dough stage: 43, 44, 51, 40, 29, 49, 39, 59, 43, 48, 67, 44, 46, 54, 64. Test 
the significance of the difference between the two means. Ans. t ** 1.38. 

EXAMPLE 4.7 — When yearling steers are shipped in from the range for fattening, 
so little is known of their previous history that estimating outcome is an uncertain 
process. They may be just as well placed in lots at random. Culbertson and Hammond 
(4) report the average daily gains of two lots of steer calves, one (I) having a supple- 
ment of linseed oil meal, and a second (V) receiving a light allowance (1 pound per 
steer daily) of whole soybeans. The rates of gain were: (I) 1.95, 2.17, 2.06, 2.11, 2.24, 
2.52, 2 04, 1.95: (V) 1.82, 1.85, 1.87, 1.74, 2.04, 1.78, 1.76, 1.86 pounds per day. 
Calculate the difference of the means, 0.29 pound per day, s ■■ 0.148 pound per day, 
and t « 3.92. 
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EXAMPLE 4.8 — In reading reports of researches it is sometimes desirable to supply 
a test of significance which was not considered necessary by the author. As an example, 
Smith (10) gave the mean yields and standard errors of two crosses of maize as 8.84 
at 0.39 and 7.00 * 0.18 grams. Each mean was the average of five replications. We 
wish to determine if the mean difference is significant. It is necessary to retrace the 
original computations till each sum of squares is reached. Starting with s$ *» 0.39, 
$£ 2 ** 0.1521, s 2 ** 0.7605, and Sx 2 = 3.042. Similarly, in the second cross, Sx* = 0.648. 
Pooling these two sums of squares with d.f. «* 8, the value of t may be calculated. 
Ans. t = 4.29. 

EXAMPLE 4.9 — Derive the formula, t = Sy/n(n — 1 ) /Sx 2 , starting with the 
rules of calculation given in connection with table 4.3. 

4.3— Difficulties met in experiments with groups. The foregoing 
test of the null hypothesis, m = 0, involves the subsidiary hypothesis 
that the comb weights in the two populations are normally distributed 
with common variance. 

As usual, the accuracy of the probabilities is little affected by even 
considerable departures from normality. The matter of population vari- 
ance will be discussed in section 4.6. More immediate is the question as to 
whether the estimate of variance, s\ correctly evaluates the variation, 
aside from treatment effects, to w hich the experimental means are subject. 
In the present instance this is aimed at by choosing chicks of one breed 
and se£, all of the same age, then housing them together. Presumably 
no extraneous environmental factors differentiated the lots, only the 
hormone treatments being responsible for the population difference in 
means. The designs of many group comparisons leave these matters in 
doubt. If lots are kept in separate cages, or if two groups of pots are 
affected by positional differences in the greenhouse, the indicated popula- 
tion difference may be a combination of treatment and other environ- 
mental effects and s 2 may be a biased estimate of a 2 . The treatment dif- 
ferences are then said to be confounded with the otheis, and there may be no 
means of distinguishing them. In these circumstances, statistical evidence 
that the population averages are different may have little bearing on the 
problem of possible treatment effects. Although the statistics may lead 
correctly to rejection of the null hypothesis, and although this hypothesis 
may, in fact, be untrue, yet the treatments themselves may have no effects, 
or may tend to cause a difference opposite to that in the sample. Further- 
more, there may be no way to estimate separately the amounts or direc- 
tions of the two kinds of environmental effects, treatment and extraneous, 
the population difference being compounded of such effects in unknown 
proportions. If the experiment is conducted so as to avoid such confusion, 
a 2 will be estimated by s 2 and real treatment effects may be expected to 
lead to significant differences. 

The reader who omitted the latter part of chapter 2 may well now skip to chapter 6 for which 
all necessary preparation has been made. Sections in chapter 5 can be picked up as they are needed. 
If it is analysis of vartance that is most interesting, skip to chapters 10 and 11. 

4.4 — Size of sample. In the formula for /, given near the end of 
section 4.2, division of both numerator and denominator by n — 1 gives 


t - £\/n/(2s 2 ). 
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where s 2 is the pooled variance. If this equation is solved for n, then 

n = 2 tV/x 2 = 2 

where C is the relative standard deviation or coefficient of variation (section 
2.11). With the usual assumptions (section 2.14) one may substitute for t 
its 5% (or 1%) level then calculate the number of observations required 
in each group to produce this value of t. For discussion of changes in 
degrees of freedom see (3). 

As an example, let us estimate the number of chicks (table 4.1) in 
each group necessary to make t = 2.086 provided C 2 does not turn out to 
be larger than 811/(41) 2 = 0.482: 

n = 2(2.086) 2 (0.482) = 5 chicks per lot 

The nature of the uncertainty in making such estimates is emphasized 
by applying the formula to the data of the uniformity trial in example 4.1. 
If we did not know that these two samples were drawn from the same 
population we might estimate 

n — 2(2. 1 01 ) 2 (1 4.44) /9 = 15 pigs 

as the number in each lot ample to show a significant difference. How- 
ever, should we then draw the larger samples, we would likely get values 
of x and s 2 (known to be estimates of zero and 25 respectively) such that t 
would again be moderate in size, and the same would be true for any size 
of sample taken. But we might be misled. A hazard in applying this 
method is that we might happen to get one of those one-in-twcnty samples 
with t > 2, then decide that the population difference is not zero, though 
it really is. Estimation of sample size with consequent repetition of the 
experiment is a common and useful practice, but it must be done with the 
canny caution necessary in all inquiries about the unknown. 

EXAMPLE 4.10 — The two lots of kernels in example 4.6 had x =» 5, s 1 = 98.7 
and df. * 28. Estimate the number of kernels in each lot necessary to determine 
significance. Ans. 34 kernels. 

EXAMPLE 4.11 — If you were designing an experiment to evaluate decrease in 
concentration of nectar during flight, and if you considered the data of example 4.3 
pertinent, what number of bees per lot might you try? Ans. 2 bees, according to the 
formula, but look at t M for dj = 2. 

EXAMPLE 4.12 — Compare the formulas of sections 2.14 and 4.4. What theorem 
would enable you to infer either from the other. 

4.5 — Groups with different numbers of individuals. There is no 
necessity that the two groups be of the same size. In much experimentation 
it is inconvenient or impossible to provide equal numbers of individuals. 
Two lots of chicks, for example, from two batches of eggs differently treated 
would nearly always differ in numbers of birds hatched. Furthermore, 
accidents and death usually befall the lots unequally. This may not 
change the statistical theory, and causes only a slight alteration in the 
method of making the comparison. 
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As an illustration, consider the gains in weight of two lots of rats fed 
different levels of protein, table 4.4. If n x and n 2 are the numbers of 


TABLE 4.4 

Gains in Weight of Two Lots of Female Rats on Different Diets The Gains 
Occurred Between 28 and 84 Days of Age 


Diet 

Number 
of Rats 

Gains in Grams 

High protein 

12 

134, 146, 104, 119, 124, 161, 107, 83, 113, 129, 97, 123 

Low protein 

7 

70, 118, 101, 85, 107, 132, 94 


observations in the two groups, the corresponding degrees of freedom are 
tti — 1 and n 2 — 1, a total of n\ + n 2 — 2. The variances of the two group 
means are now different, s 2 /ni and s 2 /n 2 , while the variance of the differ- 
ence of the means, the sum of the two variances, is 

s 2 /ni + s 2 /n 2 = s 2 (i/ni + l/«*) = ^ — — 

n x n 2 

so that finally, 

^2 ni + _^2 __ s / ni + n 2 
ii\fi 2 \ n\n 2 

If n i = Si becomes, as it should, the same as that in table 4.3. The 

necessary computations are laid out in table 4.5. 




TABLE 4.5 

Summary of Statistics for Comparison of Two Groups of Different 
Sizes. Rat Data of Table 4.4 


Diet 

Number 
of Rats 

Degrees of 
Freedom 

Mean Gain (grams) 

Sum of Squares 

High 

12 

11 

120 

5,032 

Low 

7 

6 

101 

2,552 



Sum *17 j 

Difference — £ = 19 

Sum « S* 3 - 7,584 


Pooled variance *= j* * 7,584/17 446.12 

Si *= \Zs*(ni -p n 2 )/tnn 2 «* \/ 446 12(12 -f- 7)/ (1 2) ( 7) = 10.04 grams 
I - 19/10.04 - 1.89, </./. * 17, P * 0.08. 


Since P > 0.05, the null hypothesis need not be rejected. So far as 
this small sampling is competent, the levels of protein fed may not differ- 
entiate rat gains into two populations. 
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The process of testing outlined above for groups of different sizes may 
be condensed into this formula: 


s/ 


nintini + ri2 — 2 ) 

+ n 2 )Sx 2 ’ 


as before, being the difference between the group means and Sx 2 the 
pooled sum of squares. Applying this formula to the rat gains: x = 120 
— 101 = 19 grams, n x n 2 = (12) (7) = 84, n x + rc 2 = 19, n x + n 2 — 2 = 
17, and Sx 2 = 5,032 + 2,552 = 7,584. Upon substituting these values 
in the formula, t — 1.89 as in table 4.5. 


EXAMPLE 4.13 — The following are the rates of diffusion of carbon dioxide 
through two soils of different porosity (9). Through a fine soil (F): 20, 31, 18, 23, 23, 
28, 23, 26, 27, 26, 12, 17, 25; through a coarse soil (C): 19, 30, 32, 28, 15, 26, 35, 18, 
25, 27, 35, 34. Show that pooled r* = 35.83, a = 2.40, d.f. = 23, and t « 1.67. The 
difference, therefore, is not significant. 

EXAMPLE 4.1 4-— The total nitrogen content of the blood plasma of normal albino 
rats was measured at 37 and 180 days of age (11). The results are expressed as grams 
per 100 cc. of plasma. At age 37 days, 9 rats had 0.98, 0.83, 0.99, 0 86, 0.90, 0.81, 
0.94, 0.92, and 0.87; at age 180 days, 8 rats had 1.20, 1.18, 1.33, 1.21, 1.20, 1.07, 1.13, 
and 1.12 grams per 100 cc. Compute t = 8.2. 

EXAMPLE 4.15— Pearson and Catchpole (8) reported the means and standard 
errors of inorganic phosphorous per 100 ml. of blood serum as determined from 26 
Pcrcherons and 18 Shetlands, 3.29 =*= 0.27 and 3.96 =*= 0.40 mg. To test the signifi- 
cance of the mean difference, 0.67 mg., compute the pooled sum of squares, 96.345 
(see example 4.8); Si 338 0.464 and / = 1 44. The difference is not significant, but may 
be easily a consequence of sampling variation. 

EXAMPLE 4.16 — It is reported that the mean peiccntage of bacon from the 
caicasses of 15 Poland China sows was 6.59% with standard error, 0 300%. If you 
assume normal distribution, you can estimate the lowest and highest percentages in 
the sample as 4.6% and 8.6%. Would you have any reason to suspect the hypothesis 
of normality? 


4.6- — Null hypotheses and the validity of their tests. Occasionally 
there is reason to believe that the variances of the two treated popula- 
tions are not the same. If they are not, the /-tests in tables 4.2 and 4.4 
are nevertheless exact, but the null hypothesis being tested is this: the 
two samples {groups ) are randomly drawn from the same normal population; 
or in other words, the normal populations sampled have the same means and the 
same variances. In this situation, a significant value of t may be attributed 
to a difference of means or a difference of variances or both. 

Interest in an experiment may be enhanced by evidence that two 
treatments produce different variances. For instance, the following 
statistics were calculated from the data on nectar concentration in the 
honey bee, example 4.3: 


Concentration 
of Syrup 


Mean 

Decrease 


Standard 

Deviation 


Relative Standard 
Deviation, C 


33% 

40% 


20 % 

65% 


0 5 

1 9 


0 165 
0 767 
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Not only are the population variances undoubtedly different (section 10.13 
provides a test) but even the coefficient of variation is greater in the group 
with the larger mean. This kind of information may be more valuable 
than that about the difference between the means. 

Despite this possibility, the fact is that the majority of investigators 
wish to know if their treatments affect the means of their population, 
irrespective of variance . They wish to test the hypothesis, mi — mi = m 
= 0, whether or not the samples are from populations with identical 
variance. If the hypothesis of equal variances is true, then the /-tests 
given above are exact tests of the hypothesis, m = 0. If there is reason 
to suspect that the variances are not equal, what shall be done? The 
answer is somewhat complicated, and will be given in three parts. 

1. There arc two classes of investigators who use tests of significance. 
Most of the readers of this book are among those who look upon such a 
test as an integral part of the evidence upon which decisions are based, 
but often as only a small part. For the moment, these may be called 
casual testers. Then there are routine testers who, planning a scries of 
trials, adopt the invariable rule: reject the hypothesis if the probability 
is less than 0.05 (or 0.01). For these latter, it is essential to have the 
probabilities evaluated as precisely as possible, otherwise the proportion 
of rejections may be biased. They will be especially interested in refer- 
ences (1), (6), and (12), which are discussions of modifications in the 
testing appropriate for heterogeneous variance. 

2. For the purpose of testing the means of two groups in which n x = n 2 , 
the casual tester may, with one exception, ignore any difference in the vari- 
ances of the groups. Exception: enter t-table with dj. — n — \ instead of 
2 (n — 1). With this exception, values of probability near the 0.05 level, 
got by the procedure of table 4.3, are likely affected by unequal vari- 
ances no more than a point or two in the third decimal place. Deci- 
sions about the means, based on all the evidence, will be unchanged by 
heterogeneity of variance if n x = rz 2 . 

3. If t?i and n 2) along with the corresponding variances, are greatly 
different, the casual tester may wish to take advantage of an easily 
applied approximate method proposed by Cochran and Cox (3). Since 
the procedure is somewhat different from that of table 4.5, it is illustrated 
in tables 4.6 and 4.7. The purpose is to test the hypothesis, m — 0, with 
no hypothesis about the population variances. The sums of squares are 

TABLE 4.6 


Decrease in Concentration of Nectar in Honey Sack of Bee Between Feeder 
and Hive. Two Days in September, 1931. From O. W. Park (7) 


Day in 

Number 

1 

September 

of Bees 

Decrease in Percentage Concentration 

3 

20 

1.0, 0.9, 0.8, 1.0, 0.8, 1.1, 0.6, 0.4, 0.4, 0.7, 0.9, 0.5, 0.9, 



0.3, 0.3, 0.8, 0.6, 0.4, 1.0, 0.6 

12 

10 

2.5, 0.6, 1.1, 0.6, 1.8, 0.6, 1.2, 1.2, 0.4, 1.0 
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TABLE 4.7 

Approximate Method of Testing the Hypothesis, m » 0, With 
No Hypothesis About Variance 



Number 

1 

Degrees of 

Mean 

Percentage 


* s * 

5 dj- 


=* s 2 /n 



Date 

Bees 

Freedom 

Decrease 


Sx 2 

t 

06 

9- 3 

20 

19 

1 1 

1 24 

0 0653 

0 

00326 

2 

093 

9-12 

10 

9 

0 7 

3 72 

0 4133 

0 

04133 j 

2 

262 


30 

28 

j =0 4 

1 

1 

_ J 

0 

04459 j 




V0.04459 - 0.211 x / s x - 0.4/0.211 - 1 90 

(0 00326) (2.093) + (0.04133)(2 262) 


5% level = 


0 00326 + 0 04133 


2 25 


not pooled but the variance of each mean is calculated separately. The 
variance of the mean difference is, as usual, the sum of these variances. 
But the familiar ratio, x/s x , is not now distributed like t with dj . — 28, 
so cannot be compared to 1 05 = 2.048. Instead, an approximation to 
the 5% level of this ratio is computed as the weighted mean of the two 
/-values for dj. = 19 and dj. = 9, the weights being the two variances, 
s^. That is, the sum of the products of the two pairs of numbers in the last 
two columns is divided by the sum of the variances. Since the ratio, 
x/st = 1.90, is smaller than its 5% level, 2.25, the hypothesis, m = 0, 
is not likely rejected. On the contrary, if the ordinary /-test is used to test 
the hypothesis that the samples were drawn from the same normal 
population (section 4.5), then 


/ 



( 20 ) ( 10 ) ( 28) 
(30) (4.96) 


2.45, dj. = 28, 


and this hypothesis would presumably be rejected. The variances of the 
populations are evidently different (for a test, see section 10.13), but the 
means may not be. 


4.7 — Individual vs. group comparisons. The decision as to whether 
an experiment should be set up for individual or group comparisons is one 
of considerable import. It may even determine the success or failure of the 
trial. While there is no universal formula for determining the preferable 
design, there are a number of relevant circumstances on which judgments 
may be founded. Some of these follow. 

Pairing is often indicated clearly in the experimental material. Crop 
yields in successive seasons tend to be so different that no one would 
think of comparing the results of one treatment last year with those of a 
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second treatment this year. Both treatments must be applied every season 
with a consequent pairing of the yields. Much the same can be said of 
different localities in the same year. Again, in the many experiments 
involving two measurements on the same individuals the measurements 
are nearly always paired. An example is the weighing of animals in two 
successive months in order to determine gain in weight. 

On the other hand, pairing is not practicable in a variety of experi- 
ments. Among Breneman’s chicks (section 4.2) there was no basis for 
matching individuals, hence, they were randomly assigned to the treat- 
ments. When yearling steers are shipped in from the range for fattening, 
so little is known of their history that estimating outcome seems to be 
doubtful. Again, if circumstances dictate different numbers of individuals 
for the two treatments, pairing is clearly impossible. 

Aside from these rather obvious situations the design often depends 
upon the investigator’s information about his experimental material and 
its environment. He may know, for example, that potted plants on two 
greenhouse benches will grow alike if treated alike. He could then apply 
one treatment to the pots on one bench and a second treatment to the 
others, then compare the two treatments by the methods of this chapter. 
But more often he will know that growing conditions on the two benches, 
and even on adjacent portions of the same bench, are different. He will 
avoid the effects of this variation by placing side by side pairs of pots with 
the two treatments, then replicating the pairs. The appropriate statistical 
method will be that of chapter 2. 

With increasingly precise experimentation in any field, resulting from 
more exact knowledge of the behavior of the experimental material, 
group comparisons are likely to be replaced by those of individuals. In 
feeding trials with growing pigs, animals of the same litter, sex and initial 
weight arc paired because it has been learned that they are more likely 
to gain alike on the same ration than are two randomly assorted indi- 
viduals. Hence, if from each pair one member, chosen at random, is 
given one feed while the other member receives the second, the differ- 
ences in gain tend toward uniformity. This means that the experimental 
error, calculated as in chapter 2, is expected to be smaller than that in a 
group comparison where the pigs are allotted to the rations at random. 

In such experiments the success of pairing depends upon knowledge 
of outcome. No advantage will be gained unless the experimenter can 
predict with some degree of success that the members of a pair have the 
same performing ability, this common ability being different from that of 
the members of other pairs. 

A feature of the method of individual comparisons is that it enables 
the investigator to utilize all the knowledge he has about the behavior 
of his experimental material in the contemplated environment. In the 
absence of this knowledge of outcome, or of any pertinent relationship, 
group comparisons are more appropriate. 
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The beginner may learn to avoid pitfalls by observing the effects of 
applying an unwarranted statistical method to experimental data. Sup- 
pose, for example, the reduction of the paired virus data in section 2.13 
had been mistakenly carried out by the method for group comparisons. 
The results would have been these: 


Preparation 

Number 

Degrees of 

Mean Number 

Sum of 

of Virus 

Half Leaves 

Freedom 

of Lesions 

Squares 

■ 

1 

8 

7 

15 

468 

2 

8 

7 

11 

172 


Sum = 14 x — 4 Sx* = 640 


s 2 = 45 71, t — 4\ / (8) (7)/640 = 1 18, P = 0 26, appioximately 


The investigator might have decided not to reject the null hypothesis, 
whereas he probably would have rejected it on the basis of the correctly 
computed statistics in table 2.2. This discussion of misapplication of 
statistical methods should be compared with that of section 3.10 in which 
the experimental design was ignored. The results are identical, though 
differently evaluated. 

Equally surprising effects follow an arbitrary pairing of data from an 
experiment designed for group comparison. When you remember that 
this design would be adopted only if the investigator were unable to 
foresee any advantage in pairing his experimental individuals, you will 
appreciate the mistake of arranging the randomly assembled measure- 
ments in pairs after the experiment is done. Perhaps the best way to 
illustrate is by use of the first two samples of table 3.3. Since these are 
two groups randomly drawn from the same population, the proper sum- 
mary is as follows: 


Sample 

Number of 

Degrees of 



Number 

Observations 

Freedom 

Mean 

Sum of Squares 

1 

10 

9 

35 6 

9(169 8) = 1 ,528 2 

2 

10 

9 

29 3 

9(151 6) = 1 ,364 4 

Degrees of freedom = 18 

\x = 6 3 

>S* 2 = 2,892 6 


j 2 = 2,892 6/18 = 160.7, t = 6.3v9(f0)/2,892.6 - 1.11, P - 0.28 


The sample difference, 6.3, and variance, 160.7, are unbiased estimates 
of the population m = 0 and a 2 = 100. Furthermore, as you would 
expect in 95% of such comparisons, t is less than its 5% value. But 
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suppose someone had rearranged the items in the two samples, pairing 
the two smallest, the next smallest, and so on to the two largest as in the 
first part of table 4.8; or else pairing small with large and vice versa as in 
the second part of the table. While the means of the samples and of the 


TABLE 48 

Two of the Possible Methods of Pairing the Data in Random Samples, 
Illustrating the Erroneous Conclusions That Might Be Drawn 




Method I 



Method II 



Sample 1 

Sample 2 

Difference 

Sample 1 

Sample 2 

Difference 


12 

11 

1 

12 

53 

— 41 


24 

19 

5 

24 

44 

-20 


29 

19 

10 

29 

32 

- 3 


33 

24 

9 

33 

31 

2 


34 

30 

4 

34 

30 

4 

1 

36 

30 

6 

36 

30 

6 


39 

31 

8 

39 

24 

15 


39 

32 

7 

39 

19 

20 


53 

44 

9 

53 

19 

34 


57 

53 

4 

57 

11 

46 

sx 

356 

293 

63 

356 

293 

63 

X 

35 6 

29 3 

6 3 

35 6 

29 3 

6 3 

X 2 

8 01 

627 34 

t 

7 04 

0 80 


differences are unchanged, the variances of the two sets of differences 
form a notable contrast, neither being unbiased estimates of the variance 
of random differences, <r = 200, in the population. In the first method, 
the large value of t would certainly lead to rejection of the hypothesis 
of zero population difference. In the second, t is an underestimate of the 
correct sample value, 1.11. It is clear from these illustrations that the 
statistical method fixed by the experimental design cannot be tampered 
with. 

4.8 — Statistics and the experiment. It will be increasingly evident 
that the statistician and the experimenter, who are usually the same person, 
must form a close cooperation to insure the success of an investigation. 
The experimenter specifies the conditions in which the trial is to be 
performed — materials and treatments, together with genetic and environ- 
mental circumstances, and the measurements that can be made. The 
statistician selects or invents a plan (experimental design) which will 
furnish unbiased and unconfounded estimates with adequate precision. 
The experimenter conducts the laboratory or field work, taking pains to 
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eliminate as nearly as possible all extraneous effects. If he is successful, 
the ensuing measurements will contain the information for which the 
experiment is set up. The statistician uses appropriate methods for ex- 
tracting all the information wrought into the data. Finally, the experi- 
menter interprets this information in the light of existing knowledge in 
his science. 

It is sometimes not understood that statistical methods can bring out 
only that information which has been incorporated into the data by care- 
ful design and execution of the sampling. Elaborate statistics are no 
substitute for meticulous experimentation. Population inferences are 
futile if dependent on carelessly collected data. It is equally often over- 
looked that extensive and conscientiously done measurements may con- 
tain little worthwhile information if the experimental design is faulty. 
It is only by a combination of appropriate design, skillful conduct of the 
experiment and suitable statistical methods that the investigator is 
assured of reliable evidence upon which to base his decisions. 

4.9 — Summary. Emphasis has been laid on the place of design in the 
conduct of any sampling. If the conditions are little known, the group 
comparisons of this chapter may be used. As the investigator’s knowledge 
of experimental material and environment is enlarged he may utilize 
designs, such as those in chapter 2 and others to follow, that tend to 
eliminate from experimental error extraneous sources of variation. 

In this chapter have been described statistical methods for group 
comparisons suitable to groups of both equal and unequal sizes. Un- 
biased estimates of differences between means and their variances are 
provided, followed by tests of hypotheses and fiducial statements. 
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Chapter 5 

SHORT CUTS AND APPROXIMATIONS 


5.1 — Standard method for computing standard deviation. When 
you were learning to compute standard deviations you were advised to 
carry no more digits in the deviations than were contained in the observed 
values. This led to small inaccuracies in the results, not serious in the 
foregoing computations. However, as the extent and complexities of the 
calculations increase, it becomes necessary to make them more precise. 
This is done in a very simple manner, by introducing a correction of just 
the right amount to compensate for the neglected digits. The method 
is now to be explained. 

For illustration, take the birth weights of 20 guinea pigs from example 
2.13. They are listed in the lefthand column of table 5.1 . It is unnecessary 
to bother about computing the mean; just glance over the set of items and 
choose an arbitrary origin, G, near the middle of the array. Since 30 grams 
is near the mean, it will be chosen now for the first calculation. However, 
it will soon appear that 29 grains, or 40 grams, or any other arbitrarily 
selected number leads to exactly the same result after the proper corrections 
for origin are made. 

The sum of the deviations from 30 is not zero, proving that 30 grams 
is not the mean of the set of birth weights. But the mean of these devia- 
tions. 0.25 gram, is just the necessary correction for mean , the amount that 
must be added to the arbitrarily chosen origin to arrive at the correct 
value of the mean, 30.25 grams. In example 2.13, the mean was calcu- 
lated in the usual way by adding the column of weights and dividing by 
20. By the short-cut method, the addition of the weights has been avoided. 
Perhaps a better attitude to take is that a check on the computations is 
made available. If the mean is calculated by two methods instead of 
one, the identity of the results proves the arithmetic. 

In the last column of the table are the squares of the deviations from 
30, together with their sum, 247. What we require is Sx 2 , the sum of 
squares of deviations from mean, usually abbreviated to sum of squares . 
The correction for sum of squares is made by the simple deduction of 
[£(.<¥ — G)] 2 /n — 5 2 /20 = 1.25. The sum of squares, then, is 247 — 1.25 
= 245.75. This may be verified by computing the 20 deviations from 
mean, 30 — 30.25 = —0.25, etc., and adding the resulting squares. A 
less tedious exercise will be provided in example 5.4. The mean square 
(variance) and standard deviation are now calculated in the usual style. 

Some people, unaccustomed to computation, confuse two of the quan- 
tities required in the foregoing calculations. First, there is the sum of the 
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TABLE 5.1 

Computation of Standard Deviation by^the Use of an Arbitrarily 
Chosen Origin of Measurement 
Birth weights of 20 guinea pigs in litters of two 


Birth Weights 


Squares of 

(grams) 

Deviations From 30 Grams 

Deviations 

X 

X - G 

(.Y - GY 

30 

0 

0 

30 

0 

0 

26 

-4 

16 

32 

2 

4 

30 

0 

0 

23 

-7 

49 

29 

-1 

1 

31 

1 

1 

36 

6 

36 

30 

0 

0 

25 

-5 

25 

34 

4 

16 

32 

2 

4 

29 

-1 

1 

28 

— 2 

4 

27 

-3 

9 

38 

8 

64 

31 

1 

1 

34 

4 

16 

30 

0 

0 

Total 

-23 28 

247 


S(X - G) = 28 - 23 - 5 S(X - <?)* - 247 

[S(X - G)]/n - 5/20 - 0 25 [S(X - G)\*/n - 5*/20 - 1 25 

f •» G + t^(A r - G)l/n ,5V - 245 75 

- 30 -h 0 25 j* - 245 75/19 - 12 93 

= 30 25 grams s = 3 60 grams 


squares of deviations from G , £(/¥ — G) 2 = 247. Second, there is the 
square of the sum of the same deviations, [S(X — • £)] 2 = 25. In the first 
the squaring is done before the addition, while in the second the order of 
the operations is reversed. The objectives and results are so different that 
it is only necessary to direct attention to the speciousness of the analogy. 

EXAMPLE 5.1 — Carry through the computations in tabic 5.1 with 25 grams as 
the arbitrary origin. The deviations will be 5, 5, 1, 7, 5, —2, etc. Verify these results: 
(i) S(X — 25) = 1 05 grams; (ii) S(X — 25) 2 ** 797; (iii) correction for mean * 5.25 
grams; (iv) correction for sum of squares — 551.25. The mean and sum of squares 
will be 30.25 grams and 245.75 as before. 

EXAMPLE 5.2 — Make the computations in table 5.1 with 35 grams as the origin. 
The deviations are —5, —5, —9, —3, etc. The correction for mean is — 95/20 
== — 4.75, so that the mean is 35 4- ( — 4 75) = 35 — 4.75 = 30.25 grams as before. 
The addition is performed algebraically as usual. The correction for sum of squares 
is ( — 95) 2 /20 451.25, a positive number. This correction, then, is deducted from 

S(X - 35) 2 . 



Section 5.2 


Short Cuts 


91 


EXAMPLE 5.3 — In table 5.1, if you use the arbitrary origin 23, or more conven- 
iently 20, you will have no negative deviations. Is there any advantage in computing 
with deviations which are all positive? 

EXAMPLE 5.4 — Here is an easy set for practice: 1, 4, 4, 7, 7, 8, 10, 10, 10, 12, 15. 
Verify the fact that the mean is 8 and the standard deviation, 4. Use various arbitrary 
origins, continuing until the process of calculation can be carried through without 
hesitancy. Try G = 0. 

EXAMPLE 5.5 — In section 3.5 it was suggested that you compute s in each of 
the samples you had drawn. You can now get the exact sample values by making the 
corrections in S(X — G ) 2 . 

EXAMPLE 5.6 — What evidence do you have that this theorem is true? The sum 
of squares of deviations from mean is less than the sum of the squares of the deviations 
from any other origin. 

EXAMPLE 5 7— Might the correction for sum of squares be calculated just as 
easily by the quantity, [S(X — G)]Z? In words, might the phrase, “sum times mean,” 
be substituted for “sum square over number”? Which do you prefer? 

EXAMPLE 5.8 — Those who enjoy algebraic manipulations will wish to derive the 
formula, Sx 2 = £(A r — G) 2 — [^(A' — G)J 2 /n. Start with the relation, x *« X — £ 

- (A - G) - (Jr - G) - (X - G) - S{X - G)/n. Either transpose the term, S(X 

— G) / n , then square and sum, or square and sum the equation as it stands. 

5.2— Special methods for use with calculating machines. The 

methods which have been described were devised to confine the computa- 
tions, so far as possible, to small integers. If a calculating machine is 
available, large integers can be manipulated almost as easily as small 
ones. Other objectives, therefore, may be set up. One is the avoidance of 
negative numbers. Another is the elimination of most of the subtractions. 
Both of these objectives are easily attained, the resulting method being 
especially adapted to machine calculation. Here it is. 

Returning to the weights in the first column of table 5.1, their sum, 
SX = 605, and the sum of their squares, SX 2 = 18,547, are readily 
calculated with any machine. The details of the operations differ with 
the style of machine used. Usually the two sums are run up simultaneously, 
the appropriate method being described in the manual of instructions 
issued by each manufacturer. With any machine it is unnecessary to 
copy down the squares of the individual weights; the sum is accumulated 
in the machine as the operation proceeds. At the end, it is only necessary 
to copy the results from the dials into a table like 5.2. The mean, of 


TABLE 5.2 

Method of Computing the Mean and Sum of Squares if a 
Calculating Machine Is Used 
Birth weights of 20 guinea pigs as recorded in table 5.1 


n - 20 
SX « 605 
f = 30.25 grams 


SX* = 18,547 
(SX)*/n « 18,301.25 
Sx 2 - 245.75 
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course, is (SX)/n = 605/20 = 30.25 grams. The correction for sum of 
squares is (SX) 2 /n = (605) 2 /20 = 18,301.25. Deducting this from SX 2 , 
the corrected sum of squares is 245.75 as before. 

This method of machine calculation may be thought of as a special 
case of the general method described in section 5.1. Now, G = 0, 
the arbitrary origin being the same as the zero point on the scale of 
weights. The deviations, X ~ G = X — 0 ~ X, are the same as the 
observed values. The correction necessary to obtain the sum of squares is 
[*S , (A r — G)] 2 /n = [£(A' — 0)] 2 /n = (SX) 2 /n. Not only is the machine 
method a timesaver, but, obviating as it does the subtractions associated 
with deviations, it affords less chances for error. 

EXAMPLE 5.9 — Using the machine method, verify x and s in examples 2 6 
and 2.14. 

EXAMPLE 5.10 — Using the machine method, verify £ and s in example 2.24. 

EXAMPLE 5.11 — If a machine is used for calculating the standard deviation of 
the differences in table 4.2, what change in the operation would be required by the 
negative differences? 

EXAMPLE 5.12— Might the correction for SX 2 in table 5.2 be calculated just as 
easily from the phrase “sum times mean” (example 5.7) ? In symbols, is (SX) 2 /n 

- (SX)£? 

EXAMPLE 5.13 — It is easy to derive the formulas, Sx* = SX 2 — (SX) % /n — SX 7 

— xSX — SX 2 — ni 2 . Start with the relation, x -f x *= X. 

EXAMPLE 5 14 — In example 2.6 were recorded the differences in yield on 16 
pairs of plots in 1933 and 1934. In the three preceding years there were 25 other pairs, 
with differences 9.0, 1.4, 4 1, 0 0, 6.3, 4 6, 1 1, -1.2, 2 0, 2 7, -1 6, 6.3, 4.5, 9 1, 
0.4, —0.2, 2.8, 8 6, 4.1, 6 0, 8 1, 10.6, 6 2, 8 7, 8.4 bushels per acre Compute the 
mean of the 41 differences, 4.18, and the standard error, 0.615 bushels per acre Is 
the difference significant? 

5.3 — Deviations as code numbers. Even when a calculating machine 
is available, it is occasionally desirable to use deviations from some easily 
subtracted arbitrary origin. As an example, consider the weights of 10 
tomatoes: 206, 217, 224, 227, 228, 231, 236, 241, 245, 258 grams. It is 
easy to subtract 200 grams from each item without rewriting the set. The 
machine is used to run up SX =313 and SX 2 = 11,741 for the deviations, 
6, 17, . . . 58. The mean, then, is 200 + 313/10 = 231.3 grams, the sum 
of squares being 11,741 — (31 3) 2 / 1 0 = 1944.1. There is a worthwhile 
saving in the size of the numbers, with little chance of error in making 
the subtractions. 

The deviations from 200 are sometimes called code numbers. In the 
computations they are used instead of the observed values in order to 
facilitate the operations. If deviations were the only kind of code numbers, 
there would evidently be no advantage in having a second name for them. 
A rather obvious application of another kind of code number is shown in 
table 5.3. For present purposes, the hydrogen ion concentrations (pH) 
are ignored. The readings in the second column are coded by: (i) multi- 
plication by 10,000, and (ii) subtraction of 200. The computations of ot 
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and s are done with code numbers. The resulting mean must first be 
increased by 200 then divided by 10,000, these inverse operations* being 
applied in reverse order to the coding operations. You will notice that s 
is not affected by the subtraction of 200; but it, like the mean, must be 

TABLE 5.3 

The Use of Code Numbers in Calculating Mean and Standard Deviation 
Cyanogcnetic nitrogen recovered from peach stems (2) 


Initial 

pH 

Grams per 50 Grams of Tissue 

X' 

Code Numbers 

X = 10,000a' - 200 

5 1 

0 0238 

38 

5 0 

0 0238 

38 

5 3 ; 

0 0241 

41 

5 3 

0 0241 

41 

5 7 

0 0250 

50 

5 7 

0 0247 

47 

6 0 i 

0 0241 

41 

6 1 

0 0238 

38 

6 6 

0 0226 

26 

6 5 

0 0232 

32 

n - 10 

SX - 392 

SX 9 - 15,784 
{SXY/n = 15,366 4 

s = 6 81 code numbers 
= 0 000681 gram 


I — 39 2 (ode numbers Sx 2 = 417 6 


0 02^92 gram P = 46 4 


compensated for the coding operation of multiplication by 10,000. Coding 
by multiplication or division may be thought of as a change in the unit 
measurement. If it is easier to carry on the calculations with a unit of 
one ten-thousandth gram, little chance of error is introduced by the 
change. At the end, the original unit is recovered by a mere shift of the 
decimal point. 

5.4 — True coding. The really practical use of code numbers arises 
from a custom general among researchers. Both from necessary precau- 
tion and from convenience they are prone to collect their data in finer 
units than are suitable to statistical treatment. A good example appears 
in the report of a survey of 145 Iowa farm families with the object of 
learning about their habits of food consumption (6). Naturally, the record 
of the annual consumption of meat was in pound units, that of the first 
20 families being 726, 296, 928, 668, 287, 1206, 517, 1638, 2414, 610, 
494, 2489, 1198, 676, 1302, 440, 1247, 1053, 1029, 218 pounds. The 
range is 2489-218 = 2271 units. In most statistical investigations a range 
somewhere between 20 and 40 units is ample. It turns out, then, that the 
hundredweight is a more suitable computing unit for the set above, the 
appropriate code numbers being derived from dividing each item by 100, 

* Addition and subtraction arc inverse operations; so arc multiplication and division. 
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then rounding: 7, 3, 9, 7, 3, 12, 5, 16, 24, 6, 5, 25, 12, 7, 13, 4, 12, 11, 10, 
2 cwt. The statistics of this coded sample are, x — 9.65, s = 6.36 cwt. 
Of course, these must be multiplied by 100 to recapture the original unit, 
so that x = 965 pounds and s = 636 pounds. 

In rounding, numbers like 13.51 are increased to 14, while such as 
13.49 are lowered to 13. The dividing line being 13.50, what shall be done 
if this particular number appears? In order to insure increases and de- 
creases in approximately equal proportions a good rule is to round to 
the nearest even number. Thus, 13.50 and 14.50 would each be rounded 
to 14. 

You will immediately wish to know whether any valuable information 
has been lost owing to the rounding of the code numbers. The statistics 
of the original sample are x — 972, s = 637 pounds. Is the discrepancy 
disturbing? Consider the standard error of the mean, Si = 637/\/20 = 142 
pounds. 

EXAMPLE 5.15 — The dry weights of clippings from five bluegrass plants (4) 
for the week ending July 7 were 1 .2, 0.9, 1 .0, 1 .4, 0 9 grams. Before computing f =» 1 .08 
and s = 0 217 grams, imagine each item multiplied by 10, that is, ignore the decimal 
points. The correct results are got from dividing the coded statistics, £ and j, by 10. 

EXAMPLE 5.16 — Do you have a name for such quantities as SX 2 and iJX — G )*? 
You can’t use “sum of squares” because that is reserved for Sx 2 . You might say “sum 
of squares of the observations” and “sum of squares of deviations from G.” 

Here are three examples for those interested in the algebra of the preceding and 
following sections. 

EXAMPLE 5.17 — If Y = A -f A r , A being constant, prove that y — A -f x and 
Sf « .SjA 

EXAMPLE 5.18 — If Y = bX (b is another constant), prove that y = hx and 
Sjy 2 =* b 2 Sx 2 . You won’t overlook the fact that b may be a fraction. 

EXAMPLE 5.19— If Y - A + bX , prove that y - A + b£ and Sf « b 2 Sx 2 . 

5.5 — Rules and precautions about code numbers. The objects of 
coding are to promote accuracy and save labor. Be sure that these 
advantages are great enough to more than counterbalance the time and 
risk of error involved in coding. 

Every item in a set must be treated alike. If you were to add one 
number to some of the items and a different number to the others, the 
results would be meaningless. 

The mean is affected by every coding operation. As an example, if 
items are coded by first subtracting 100 then dividing by 10, the coded 
mean must be first multiplied by 10 then increased by 100, the inverse 
operations being applied in reverse order. 

The standard deviation is affected by only multiplication and division. 
It should be clear that addition and subtraction do not affect measures 
of variation such as range or s ; such coding merely shifts the origin of 
measurement without contracting or expanding the unit. On the other 
hand, multiplication and division change the unit of measurement and 
must be compensated by applying the inverse operations to the coded 
statistics. 
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The precision of the final result is not affected by addition, subtraction, 
multiplication or division, but is influenced by dropping off digits and 
rounding. 

The inference was made in section 5.4 that a range somewhere between 
20 and 40 units is desirable. The reference was to code numbers, since 
they do not ordinarily involve decimals. A more general statement is 
that the range when expressed in significant digits should lie somewhere 
between 20 and 40. Significant digits (the term has no technical kinship 
with that employed in sections 1.13, 3.9, and 4.2) are independent of the 
position of the decimal point. The significant digits are the same in the 
numbers 647, 0.0647 and 6.47. If the range in one set is 0.067 — 0.031 
— 0.036 gram, and in another, 23.1 — 19.5 = 3.6 pounds, the ranges in 
significant digits are the same, 36. When considering the number of 
digits to retain for computation, the decimal point may be ignored until 
it is time to make a record of the plan selected. 

EXAMPLE 5.20 — In example 6 33 are given percentages of water in apple twigs 
and corresponding measurements of specific heat. If you have a calculating machine 
available, compute £ and s for each character. Note especially that you cannot round 
off the last digit in either column. If you were to do so, the range of rounded numbers 
in column 2 would be only 65 — 49 = 16, which is less than desirable. Of course, 
you may omit the decimal point, then subtract 400 from each percentage if you wish. 
With a machine, however, the saving in time is questionable unless you are accurate 
enough to carry on the coding while operating the machine and without rewriting the 
column. Try it. 

5.6 — Significant digits. “How many decimals should be carried” 
is a question often asked. In that form there is no definite answer. When 
the question is put in the form, “How many significant digits should be 
carried?” then answers are possible though various. The distinction, 
pointed out in the preceding section, is made clearer by the following 
example. A weighing is made on a balance read directly to fifths of a 
gram, but estimated to tenths. The weight is 25.3 grams. But this result 
might be recorded as either 25,300 mg. or 0.0253 kg. In any of the three 
forms the significant digits are 253, quite unaffected by the unit of measure- 
ment. Even when the question above is limited to significant digits, how- 
ever, there is an amazing array of considerations affecting the number of 
figures carried. It seems quite impossible to reduce them to rules. 
Throughout this text we have endeavored to display good form in han- 
dling numbers. At various points we have commented on unusual compli- 
cations. In this section, we shall attempt little more than to suggest some 
trains of thought and some experiments in calculation. 

If 25.3 g. is the dry weight of some forage plant grown in a pot, it may 
be necessary to convert the figure into pounds per acre. The conversion 
factor might be 94.327 pounds per acre per gram. How many significant 
digits should be carried in the product? Remembering that 25.3 grams 
really is an estimate of some weight between 25.25 and 25.35, the number 
of pounds per acre may be anywhere between 

(25.25) (94.327) = 2,381.8 lbs. per acre 
(25.35) (94.327) = 2,391.2 lbs. per acre 


and 
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The mean of these two products is equal to (25.3) (94.327) = 2,386.5. 
Three facts are clear: (i) even the third significant figure may not be 
right, but (ii) it takes four significant figures 2,386 to estimate the product 
satisfactorily, and (iii) the factor with the fewer significant figures limits 
the accuracy of the product — the weakest-link-in-the-chain idea. It is 
easy to verify the fact that the product of two three-figure measurements 
is even less accurate than the result considered above, but that the fourth 
figure still gives some information about the average value. A good rule 
in multiplication, then, is to carry one more significant digit than those 
in the shorter of the two factors. 

You will find by a little experimentation that the facts about quotients 
are much the same as those about products. But consider the combination 
of subtraction and division illustrated in a report of the percentage de- 
crease in amino nitrogen in a cabbage plant extract during distillation 
with calcium oxide (8). The number of milligrams before distillation was 
14.3; after, 10.8, the percentage decrease being 24.5. At the limits of 
approximation, the decrease was between 

14.35 - 10.75 = 3.60 mg. 
and 14.25 — 10.85 = 3.40 mg. 

The percentage decrease was, therefore, between 

(3.60)/14.35 = 25.1% 
and (3.40)/14.25 = 23.9% 

Thus, inaccuracy has crept into the second significant figure, all beyond 
the third being meaningless. 

In computing the sum of a number of items similar to 25.3 grams, since 
there are likely to be about as many weights too small as too large, there 
is a tendency for the excesses to cancel the defects. The mean, therefore, 
can be carried safely one or two significant figures farther than the items. 

Rules could be devised for the computations discussed above but they 
would have to be discarded promptly when a whole series of calculations 
is to be performed, as in the computation of the standard error of a mean. 
Continuous adherance to the rules would result in the introduction of 
exaggerations of inaccuracies. To be on the safe side, two or three extra 
figures must be carried, especially if calculating machines are available. 
The final results should be cut back to a reasonable number of significant 
figures. 

What is a reasonable number? The errors of measurement considered 
above are relatively insignificant when compared with sampling variation. 
As an example, a mean, 55.957 mg., may be accurate to five significant 
digits in a large sample, and its standard error may be correctly computed 
as 1.754 mg. But since the 50% semi-fiducial interval is (0,6745) (1.754) 
= 1.2 mg., the report might well be limited to 56.0 =*=1.8 mg. so far as 
inferences about the population are concerned. However, that brings up 
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another complication. If we wish to reconstruct the normal sample from 
which these statistics were calculated, in the manner of example 4.8 tor 
instance, it would be advantageous to have at least as many significant 
figures in as there are in x , the actual number depending somewhat 
upon the size of the sample (since s* is to be multiplied by y/n). 

You begin to see why one can’t follow any one set of specialized rules 
about significant digits. In simple products and ratios, it is easy to make 
the results compatible with the precision of the original measurements. 
If you are making a long sequence of calculations for the first time, carry 
along several more figures than you expect to retain in the final form. In 
preparing your report, limit all numbers to a size commensurate with their 
precision and contemplated utility. 

EXAMPLE 5.21 — In making the weighings of dressed swine carcasses, and espe- 
cially of live weights, accuracy beyond whole pounds is seldom attained. What would 
you think if you saw the report of a dressing percentage, 81.764%? 

EXAMPLE 5.22 — Suppose, in calculating the sum of squares of 15 two-figure 
items, SX = 545, so that x = 36.33. The correction for sum of squares is, then, 
(545) 2 /15 = 19,801.67. If you had used the formula of example 5.7, you would have 
got (545) (36.33) = 19,799.85. Hence, if you say “sum times mean” is the correction 
term, you must carry the mean to more significant figures than you expect to report. 
If you wish to get the foregoing correction term accurate to two decimal places, how 
many significant figures will be required in the mean? Note that this is not a question 
of sampling variation, but is merely a matter of correct calculation. The rounded off 
figures in the mean, 0.00333 . . . , are magnified 545 times by multiplication, and 
(545) (0.00333 . . .) is 1 81. If you wish to use the formula “sum times mean,” and it 
is very convenient at times, you must be sure to carry the mean to an adequate number 
of significant figures. 

5.7 — The distribution of the ratio, range/s. Turning from methods 
of computation, we introduce a topic of great utility in the common sense 
understanding of statistics. As has been emphasized, the two measures of 
variation, range and standard deviation, are not unrelated. In example 
3.1 it was suggested that you calculate the ratio, range /s, for each of your 
samples of 10. Table 5.4 is the frequency distribution of 500 such ratios 
in samples similar to yours. If you make the histogram you will be struck 
by the asymmetry of the distribution as well as by the magnitude of the 
range. You can easily find where your ratios belong in the table — which 
are near the average and which among the extremes? 

TABLE 5.4 


Frequency Distribution of range / s in 500 Samples of 10 





Mean = 318 







Class mark 

1 95 

2 05 

2 15 2 25 2 35 

2 45 

2 55 

2 65 

2 75 

2 85 

2 95 

Frequency 

I i 

l 

0 

1 0 1 

2 

8 

24 

25 

43 

44 











Class mark 

3 05 

3 15 

3 25 3 35 3 45 

3 55 

3 65 

3 75 

3 85 

3 95 

4 05 

Frequency 

58 

61 

55 45 45 

36 

27 

12 

9 

2 

1 
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If you should investigate the corresponding distributions for samples 
of other sizes, you would find that the average value of the ratio, range j s, 
tends to increase with sample size. Why? As more items are drawn, there 
is an increasing likelihood of including some of the unusual extremes 
which are in the population. Both range and standard deviation tend to 
increase with sample size, but range goes faster. 

Tippett (9) and E. S. Pearson (7) have examined the distribution of 
range /a in samples from normal populations. Tippett tabulated the mean 
values and Pearson, the 5% and 1% points for samples of many sizes: 
table 5.5 is taken from their reports. In practice, it is only range /s that is 
known, but attention is called to the close agreement between the mean 
value of range /s = 3.18 in table 5.4 and the mean of range/ a, 3.08, in tabic 
5.5. In the kinds of applications made in this text, the discrepancy is 
unimportant. 

TABLED 5. 5 

Mean Values of the Ratio, Range/ct, Together With Values That Will Be 
Exceeded With Indicated Probabiiity in Random Sampies of Size n Drawn 
From the Normal Population 


n 

Mean 

Probability 

! n 

Mean 

Probability 

05 


01 

05 

01 

2 1 

1 13 

2 77 

3 

64 

20 

3 73 

5 01 

5 64 

3 ! 

1 69 

3 34 

4 

10 

30 

4 09 

5 30 

5 91 

4 

2 06 

3 65 

4 

38 

50 

4 50 

5 64 

6 23 

5 

2 33 

3 87 

4 

59 

75 

4 81 

5 91 

6 47 

6 

2 53 

4 04 

4 

74 

100 

5 02 

6 08 

6 63 

7 

2 70 

4 18 

4 

87 

150 

5 3 

6 3 

6 8 

8 

2 85 

4 29 

4 

98 

200 

5 5 

6 4 

7 0 

9 

2 97 

4 39 

5 

07 

300 

5 8 

6 7 

7 2 

10 

3 08 

4 48 

5 

15 

500 

6 1 

6 9 

7 4 

15 

3 47 

4 79 

5 

44 

700 

6 3 

7 1 

7.6 


Although table 5.5 already has been made use of considerably, one 
additional example may be cited. Crampton (1) reported the mean area 
of rasher of 20 swine as 26.1 square inches, with standard deviation 2.8 
square inches. Most of us have only a vague notion of the range in area 
of slices of bacon. In table 5.5, opposite n — 20, the ratio 3.73 is read. 
Therefore, average range == (3.73) (2.8) = 10.4 square inches. The larg- 
est and smallest rashers are estimated as 26.1 =*= (10.4/2); that is, 31.3 
and 20.9 square inches. Actually, the extreme areas in the sample as 
reported were 32.8 and 21.3 square inches. 

The table allows the estimate not only of the average range but also 
of the ranges that will be exceeded in only 5% and 1% of random samples 
from the normal population. Thus, it may be said that the chance of 
exceeding the range of rasher area (5.01) (2.8) = 14 square inches is only 
5% if samples of 20 are randomly drawn from a normal population with 
<j = 2.8 square inches. 
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EXAMPLE 5.23 — The mean yield of 207 plots of Kherson oats in 1916 (5) was 
74.8 bushels per acre, the standard deviation, 6.2 bushels per acre. Assuming normal 
distribution, show that the estimated range is 34 bushels per acre. Actually, the range 
was from 56.7 to 92.8. 

EXAMPLE 5.24 — The range of percentage dry matter in 48 samples of Grimm 
alfalfa was 7.6%. One would estimate the standard deviation as 1.7%. 

EXAMPLE 5.25 — It was reported that the standard deviation of a sample of 
cuckoos’ eggs was 0.964 mm., the range in length being from 19.07 to 25.02 mm. 
Would you estimate the sample size as about 600 egg S'* 

5.8 — Common sense in computation. I have never found a magic 
for insuring accuracy in computations. Despite checks and double checks, 
errors may creep in. A correction term may have been forgotten or a 
decimal point misplaced. The final safeguard must be good judgment as 
to the credibility of the results. 

Suppose someone has calculated the standard deviation of the heights 
of 200 10-year-old girls. His result is 1.2 inches. He wishes to know if 
this is reasonable. Looking in table 5.5, opposite n = 200, he takes out 
the ratio 5.5, from which he knows that, on the average, the range should 
be given by, range = (5. 5) (1.2) = 6.6 inches. A glance at his records 
reveals the actual range from 45 to 58 inches. Certainly, he has cause to 
suspect his calculated standard deviation. Another approach is this. 
Start from the known range, 58 — 45 = 13 inches. Substituting this 
range together with the ratio read from the table, the equation becomes, 
13/.r = 5.5, from which estimated s = 2.4 inches. This is a warning that 
the calculated s ~ 1.2 inches is probably too small. 

Most people have reasonably accurate knowledge about means and 
ranges in their experimental materials. Table 5.5 enables them to add a 
practical knowledge of standard deviations. It is sufficient to fix in mind 
four approximate ratios — 3, 4, 5, and 6 — corresponding to samples of 
10, 25, 100, and 500 items. If the work is with rats, the investigator is 
familiar with the mean weight of males at weaning, 52 grams, and the 
range which is roughly 90 — 30 = 60 grams. The standard deviation of 
100 such animals would, therefore, approximate 60/5 = 12 grams. 

The same approximate ratios are valuable to a person reading of the 
researches of others. He may see the report that the mean number of 
stamens in 373 late flowers of Ranunculus ficana was 18, with standard 
deviation 3.3. Assuming normal distribution, he estimates the range as 
between five and six times 3.3, say 17 or 18 stamens. He is thereby in- 
formed that the smallest number of stamens in any flower was about 
18 — 18/2, or 9 stamens. Of course, he would not be surprised to learn 
from some other source that the smallest number was actually 7, or per- 
haps 12 stamens. The point is that he can sketch in the outlines of the 
information possessed by the author of the article. 

One can learn to judge of significance from surprisingly meager infor- 
mation. It may be reported that two lots, each consisting of 10 swine, fed 
different rations, produced back fats with iodine numbers averaging 68 
and 73 respectively. If the reader of the report, from his own work in the 
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same field, had the information that iodine numbers range an average of 
10 points in samples of 10, he approximates the standard deviation as 3.3 
iodine numbers. Proceeding as in chapter 4, he estimates the mean differ- 
ence as highly significant. Or, he reads that the mean height of 800 
13-year-old girls exceeds that of an equal number of boys of the same 
age by 1.3 inches. He wishes to know if girls of this age are really the 
taller, or if the difference may be attributed to the vagaries of sampling. 
Knowing that stature, even at this age, has a standard deviation not 
greatly different from 2.5 inches, he estimates the standard error of the 
difference, 2.5\/2/800 — 0.125 inch. It is clear that, at 13 years, taller 
girls is a population characteristic. 

EXAMPLE 5.26 — One thousand butterfat determinations were made on the milk 
from a herd of Holstcins, the lowest being 2.2% and the highest 5.3%. Would you 
estimate the mean and its standard error as 3 75 =*= 0.015 percent? 

EXAMPLE 5 27 — Twenty swine were paired according to litter, sex, weight and 
condition. One member of each of the 10 pairs was fed ration A, the other B. The 
mean difference in gain at the end of 60 days was 5 75 pounds the range being from 
— 7 to 20 pounds. Show that, according to the rough method of estimation which is 
available, the mean difference fails to reach the 5% level of significance. 

EXAMPLE 5.28 — An asparagus grower recorded the production of 25 pairs of 
staminate and pistillate asparagus plants during one season. He found that the differ- 
ence, M — F, ranged from — 10 grams to 90 grams. Estimate x — 40 and s * 25 
grams. Show that the difference is highly significant. 

EXAMPLE 5.29 — Twenty soybeans were planted on each of 100 plates and placed 
in a germinating compartment (3). Suppose the standard deviation of the number of 
beans germinating on each plate had been computed as 6 beans. Show that the calcula- 
tion was in error, assuming a somewhat normal distribution of germinating seeds. 

EXAMPLE 5.30 — The standard deviation of the weights of 1,000 college freshmen 
men was calculated as 16 pounds. Is that a reasonable figure? 

EXAMPLE 5.31 — Sixty-four samples of peanuts were analyzed for nitrogen con- 
tent, the mean and its standard error being 11.04 ^ 0.117%. Would you be sur- 
prised if told that one of the samples contained only 5% nitrogen? 

5.9 — Rates and percentages. The habit of expressing one measure- 
ment as a fraction of another is as common as the use of ratios in enumera- 
tion data (section 1.16). Such quotients are called rates if the numerator 
and denominator have different units of measurement, and ratios or 
percentages if the units are the same. The yield of a crop, a rate, is almost 
always reported in some such fashion as 30 bushels per acre, while the 
dressed weight of a carcass is, say, 80% of the live weight. These rates 
and ratios are summary numbers or averages. In using them the same 
precautions should be observed as those discussed in section 1.16. A 
distinction is this: there is no test for significance of the difference be- 
tween two isolated quotients of measurement data. One can’t test directly 
the significance of the difference between 30 and 25 bushels per acre, or 
between 3.5% and 4% milk. Only if there is available additional infor- 
mation concerning variation can the test be made. The method then may 
be that of either chapters 2 or 4 depending upon the experimental design. 
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One rate interesting to many is the rate of growth of an organism. 
Growth rates are sometimes expressed as changes in weight (or size, etc.) 
per unit time and sometimes as changes in weight per unit time per unit 
weight already attained . The latter is called relative growth rate. As an 
example, the growth rate of a pig between 60 and 120 days of age, kept 
under uniform conditions, may be rather constant at 1.4 pounds per day. 
During the same 60 days the relative growth rate is declining, starting 
(if the initial weight is 70 pounds) at some such amount as 


1 .4 pounds per day 
70 pounds weight 


0.02 pound per day per pound, 


and ending at 

1 .4 pounds per day 

70 + (60)(1.4) pounds »eigh> = °' 009 > >0 ' ,nd ** da >’ P er P° und ' 

But earlier in the life of the pig a different picture may be seen. The 
actual growth rate may increase from day to day in such manner that the 
relative growth rate is constant; that is, each pound may increase by the 
same amount every day. An example of uniform relative growth rate is 
given in section 14.2. One should distinguish carefully between these 
methods of expressing growth rate. 

Another distinction that should always be clearly made is that between 
amounts of change (differences or increments) and rates of change. A 
pasture yielding 2 tons of hay per acre one year may produce 4 tons the 
next. Though the difference is 2 tons, the increase is sometimes expressed 
as 100%. As a rule, the expression of differences as percentages of a 
small base should be avoided. 


EXAMPLE 5 32 — Five dairy farmers delivered milk at a creamery in the following 
amounts* 


Farmer 

Pounds of Milk 

Percentage Butterfat 
in Milk 

A 

150 

4 75 

B 

500 

3 50 

C 

350 

4 00 

D 

400 

3 00 

E 

100 

5.00 


Calculate the average percentage butterfat, 3.71. 


EXAMPLE 5.33 — Those who have some mathematics at their command will 
observe that if the natural logarithm of the weight, log^e, is plotted against time, t, 
the slope of the curve at any point, 



1 dw 
w dt 


gives the relative growth rate. If common logarithms are used, the slope must be 
multiplied by log # 10 « 2.3026 to get the relative growth rate. 
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Chapter 6 

LINEAR REGRESSION 


6.1 — Introduction. In foregoing chapters the problems considered 
have involved primarily only a single measurement on each individual. 
This measurement may be of vitamin content or weight in which the vari- 
ate is continuous, or of color or opinion, the variate being discrete. The 
observations are compared either singly or in groups, but generally it 
has been one attribute or one kind of measurement per individual. Now 
attention is turned to inferences based on two or more observations on 
each member of the sample. For example, one may make more adequate 
judgments about the 120-day weight increase of a pig if he knows the 
related variates, initial weight, age, and feed eaten; increasing farm in- 
come during some periods may be explained partly by changes in the 
quantity of meat produced; or decisions about the comparative yielding 
abilities of varieties of corn may hinge on the numbers of plants in the 
several experimental plots. The widespread interest in correlated meas- 
urements arises from their numerous applications, only a few of which 
can be discussed in this chapter. 

In the parlance of biological and mathematical statistics, Gabon’s 
descriptive term regression has largely replaced the technical word junction 
used in mathematics. Growth curves and economic time series are re- 
ferred to as regressions of height (or income) on age (or time). Further comment 
on the origin of this usage will be found in section 6.13. 

6.2 — Regression of height on age. Twenty soybeans were planted in 
separate pots, and these were grouped on the greenhouse bench in 4 
blocks of 5 pots each. A week after the plants emerged measurement 
was made of the heights of 4 of them, one chosen at random from each of 


TABLE 6.1 

Height (Cfntimetfrs) of Groups of Soybean Plants, One Group Measured in 
Each of 5 Successive Weeks After Emergence 


Week After 
Emergence 


1 

2 

3 

4 

5 



Block Number 



1 

2 

3 

4 

Average Height 

4 

3 

6 

7 

5 

18 

19 

18 

13 

17 

26 

25 

24 

21 

24 

38 

35 

28 

31 

33 

44 

43 

39 

38 

41 
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the blocks. Measurements on similar groups of four were made in each 
succeeding week, no plant being taken twice. The heights of the 20 plants 
are recorded in t||)le 6.1, &nd the pairs of values of the two variates, age 
and height, are plotted in figure 6.1. 



12 3 4 5 

WEEK AFTER EMERGENCE 

IiG 6 1 — Height of soybean plants m 5 successive weeks after emergence. 


It is clear from both tabic and figure that growth was not uniform 
During the several weeks the increases in average height were 

5, 12, 7, 9, 8 centimeters. 

Two questions present themselves: (i) Does this sample constitute evi- 
dence for uniform growth at this stage in the life of soybean plants or for a 
growth curve different from a straight line? (li) If uniform growth is 
postulated, what is the best way to evaluate the average growth rate? 
The latter question will be attacked first because it is easier. For the time 
being we shall assume that this series of samples may have been drawn 
from a population in which there is a linear relation between age and 
height. That doesn’t mean that every plant in the population is supposed 
to have a specified height at each age. The assumption is that the average 
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height increases uniformly with age. At any particular age the inmp^jhiii * 
heights presumably vary about their mean in ijwfch the same way as did 
the weight gains of section 3.2, and the meafls of the weekly samples fall 
below or above the corresponding population values in the random manner 
characteristic of small sample averages. Question (ii) above may be 
rephrased thus: based on these samples, what is the best estimate of the 
population growth rate, assumed linear? 

One might start by averaging the five weekly increases in height, the 
sum being 41 centimeters, the mean, 8.2 centimeters. But look again at 
that sum. It is clearly the average height attained by the last 4 plants. 

If this fifth group had averaged 45 centimeteis instead of 41, it is easy to 
show that the mean of the 5 increases would have been 45/5 = 9 centi- 
meters instead of 8.2. So this average is completely determined by the 4 
plants measured at the end of 5 weeks. An average dependent upon only 
a small part of the experimental data leaves much to be desired. 

It might next be decided to calculate the growth rate to the end of 
each of the 5 weeks, then to average these. The rates would be 5/1 = 5, 
17/2 = 8.5, 24/3 = 8, 33/4 = 8.25, and 41/5 = 8.2 centimeters per 
week, and their average, 7.59. This would be an appropriate average if 
thf standard deviation of the population of plants each week were propor- 
tional to age of the plants (5). While there is a tendency in the sample 
standard deviations to increase (observe the ranges), the trend is scarcely 
uniform enough to warrant setting up such an hypothesis. This averaging 
of growth rates gives most weight to the observations at the earlier ages. 

Another way of arriving at an estimate of growth rate is known as the 
method of averages. The mean height of all the plants, (5 + 17 + 24 + 33 
+ 41 )/ 5 = 120/5 — 24 centimeters, is divided by the mean age, (1+2 
+ 3 + 4 + 5)/5 = 3 weeks. The rate, 24/3 == 8 centimeters per week, 
would be a proper estimate if the variances of the weekly samples were 
proportional to age. Again, the evidence is rather meager. This method 
lessens the weight ascribed to the heights near the beginning of growth. 

A third method, one that seems more suitable for this sample, is based 
on the hypothesis that the weekly standard deviations in the population 
are all equal. This method will be presented in detail because of its 
greater utility, though it is clear that either of the other hypotheses might 
be acceptable so far as our data arc concerned. The computation for this 
third method, somewhat more elaboi ate than those above, is described in 
the next section. 


6.3 — Calculation of regression passing through the origin. Ho- 
mogeneous variance in the groups. The calculation set out in table 6.2 
involves the products of the pairs of variates, X = number of weeks, 
Y = average height, as well as the square of X. (X is known as the inde- 
pendent variate or abscissa, while Y is the dependent or ordinate.) The 
formula for average growth rate, SXY/SX 2 , shows that a height, F, is 
the more effective in determining growth rate the greater its corresponding 
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TABLE 6.2 

Calculation of Regression Through Origin. Homogeneous Variance 

in the Groups 


Week 

X 

X* 

Average 

Height 

XV 

Estimates 
? = 8. 15 A' 

Deviations 

Y - f - 

1 

1 

5 

5 

8 15 

-3 15 

2 

4 

17 

34 

16 30 

0 70 

3 

9 

24 

72 

24 45 

-0 45 

4 

16 

33 

132 

32 60 

0 40 

5 

25 

41 

205 

40 75 

0 25 

15 

55 

120 

448 




Average growth rate = SXY/SX 1 = 448/55 = 8.15 cm. 'week 


age, A'. In other words, the effect of each Y is augmented by its A r , so that 
heights of later ages are more influential than those earlier. Technically 
speaking, under the present hypothesis the weight, which is inversely 
proportional to the weekly variance, is the same at all ages. 

The rate, SXY/SX 2 , is known as the regression coefficient , ordinarily 
symbolized by b. Under the conditions already specified, it is an unbiased 
estimate of the population regression coefficient, One condition met 
in this example but not hitherto mentioned is that the sample values of X 
must be measured exactly. Errors in measuring A" would bias the estimate, 
b . As heretofore, the term “error” does not refer to mistakes in measure- 
ment, though such would be included. Implied, rather, is the inability 
of the technician to make exact measurements. Practically, the age in 
weeks can be measured without error. But suppose the timepiece varied 
by as much as 24 hours forwards and backwards, then “error” would be 
introduced into the measurement of age, and the estimate of growth 
rate would be thereby biased, the tendency being to underestimate it. 

Yet one more assumption is being made. Though X may be chosen at 
will, there must be no selection of T; it must be taken at random from 
all the T’s corresponding to a given X. In our example, the time for 
measuring heights was arbitrary. It need not even have been spaced at 
equal intervals. But once the date of measurement was fixed, the height 
of the pre-selected plant must be recorded without prejudice. Another 
point of view is this: if the investigator wishes to select values for one of 
his variates, he thereby identifies it with X. The other variate becomes 
the random Y, If, in addition, values of Y were chosen, regression would 
cease to have meaning except perhaps as an evaluation of the whim of the 
chooser. 

Since b is an estimate of the increase in height per week, the average 
population height at any age, X, is estimated £s 

P = bX = 8.1 SX centimeters 
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This is known as the regression equation or the regression of Y on X. The 
symbol, ?, may be read, regression Y or estimated Y or predicted Y. If any 
age, such as 3 weeks, is substituted for X , the corresponding value, 

T = 8.15(3) = 24.45 centimeters, 

estimates the average population height for that age. In the table are 
estimates at each tabular date. Naturally, the sample averages deviate 
somewhat from the regression estimates, the differences, F — F, being 
known as deviations from regression or errors of estimate. These differences 
will be symbolized by d y . x . 

If desired, estimates of height may be interpolated between whole 
weeks. Thus, on the 25th~day X = 3.57 weeks and 

r == 8.15(3.57) = 29.1 centimeters 

One can even extrapolate on the assumption that the same linear growth 
rate will continue. At age 6 weeks, the height is then estimated as 

? = 8.15(6) = 48.9 centimeters 

6.4 — The regression line. This graph of the regression equation is 
the straight line drawn in figure 6.1 . By hypothesis it passes through (0, 0), 
and is constructed by plotting any or all of the points (1, 8.15), (2, 16.30), 
etc. The growth rate, 8.15, is referred to as the slope of the line because 
this line rises 8.15 centimeters with each week. Deviations from regression 
are depicted clearly by vertical distances from the line to the observation 
points, positive deviations indicating points above the line and negative, 
below. These deviations measure the failure of the sample points to con- 
form to the estimated trend. Notice the large negative deviation for the 
first week. One often sees this. Apparently it characterizes the transition 
from use of nutrients stored in the seed to those elaborated by the new 
plant. 

The regression line is a kind of moving average, passing among the 
points that determine it in a manner specified by the particular hypotheses 
set up. For a given A”, such as 3 weeks, the regression value of F, 24.45 
centimeters, is a more accurate estimate of the population height than is 
the sample value, 24 centimeters. One reason is that the regression esti- 
mate is based on 20 observations instead of the weekly 4. Another is that 
the height reached in any one week might be affected by unusual environ- 
mental conditions which tend to be equalized over the 5-week period. 

Let us summarize the procedure outlined above. If you are fitting a 
regression through the origin, first examine the group variances to learn 
if they arc homogeneous. For a test of this hypothesis, see section 10.13. 
If you decide that the population variances may be homogeneous, use 
the method of table 6.2. If you reject that hypothesis, first plot the standard 
deviations against X, then the variances. If the latter seem to yield a 
line approximately straight, use the method of averages; if the former, 
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use the first method given, averaging the weekly slopes, Y/X. Verify the 
facts that X is measured without appreciable error and that the corre- 
sponding F’s are random. 

EXAMPLE 6.1 — Following are measurements on the heights of soybean plants in 
the field (17). 


Age in weeks 

1 2 

Height in centimeters 

5 13 


i 


4 5 6 

23 33 38 


7 

40 


The weekly variance seemed uniform throughout the period. Calculate the regression, 
f = 6.029A', by the method of table 6.2. Graph the data and plot the line. Compute 
the deviations from regression, —1.03, 0.94, —2 09, —1.12, 2.86, 1.83, and —2.20. 

EXAMPLE 6.2 — In the text example, assume the standard deviation in the popu- 
lation to be proportional to age, so that b = 7.59 centimeters per week. Calculate the 
deviations from regression, —2.59, 1.82, 1.23, 2.64, and 3.05. Note the large effect 
of the first week’s height in making small the slope of the regression line. 

EXAMPLE 6.3 — Compute the deviations from regression in the text example if 
the method of averages is used. Ans. —3, 1, 0, 1 and 1. Note the increasing emphasis 
on the later heights: while the first deviation is larger all the later ones are smaller. 

EXAMPLE 6.4 — Dungan (6) studied the effect on yield consequent upon the 
removal of various fractions of the leaves of maize plants in Illinois. In one experiment 
the leaves were stripped off when the grain was in the milk stage, the total leaf area 
being reduced by 16^, 3336, 50, 66 2 6, 8334* and 100% in six lots of plants. The cor- 
responding reductions in the yield of shelled grain were 13, 17, 28, 22, 37, and 50%. 
Assuming a regression line through the origin and variance proportional to age, use 
the method of averages to compute the regression line, F = Q.477A\ Do you suppose 
the values of X were measured without error? 

6.5 — The regression, Y = a + bX. Homogeneous variance. X 
measured without error. Y randomly taken. Regressions passing 
through the origin are somewhat unusual. Ordinarily a more suitable 
equation is of the form, Y = a + bX y where b represents the slope as 
before and a is the ordinate of the point where the line crosses the Y axis 
(X = 0). For comparison we shall fit this more adaptable line to the soy- 
bean data. The computations in table 6.3 lead to the evaluation of the 
two constants, a and b . With the exception of the last column, all the 
calculations follow a pattern which has become familiar in earlier chapters. 
In the last column are products of corresponding x- and ^-deviations, such 
as (— 2)( — 19) = 38. The sum of these products divided by the number 
of degrees of freedom. 


Sxy __ 88 

4 A 

77 — 1 4 

is known as the covariance of X and Y. 

The regression coefficient is given by 

b — Sxy/Sx 2 — 88/10 = 8.8 cms. per week. 
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TABLE 6.3 

Calculation of Linear Regression. X Measured Without Error and 
Variance Homogeneous 
Mean Heights of Soybean Plants 



Age 

Height 

Deviations 

Squares of 

Products of 


(weeks) 

(cms.) 

From Mean 

Deviations 

Deviations 


X 

Y 

X 

y 

X 2 

y 

xy 


1 

5 

-2 

-19 

4 

361 

38 


2 

17 

-1 

- 7 

1 

49 

7 


3 

24 

0 

0 

0 

0 ! 

0 


4 

33 

1 

9 

1 

81 

9 


5 

41 

2 

17 

4 

289 

34 

Sum 

15 

120 

0 

0 

10 

780 

! 

88 

Mean 

3 

24 

1 




? “ y + ^^(A' — x) — 24 -f- — 3) * 24 + 8.8A' — 26.4 =* — 2.4 -f- 8.8A' cms. 


Compare this with the formula of section 6.3 where deviations from means 
were not used. 

No attention is now being paid to the origin of coordinates, but there 
is another point, (. x , ?/), marked by an arrow in figure 6.2, which assumes 


r 



Fig. 6.2 — Regression line fitted to soybean growth data. Line passes through origin 
of deviations, (3, 24), and cuts Y axis 2.4 centimeters below origin of coordinates 
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the same role in this present regression method. It may be called the ori- 
gin of deviations, since both * and y are measured from it in all four direc- 
tions. The regression now passes through this origin. To the right of 
(Xy y) the line rises 8.8 centimeters for each week in excess of x = 3, so 
that its equation may be written, 

y = 8.8* centimeters 

As an example ot the use of regression, we may estimate the population 
deviation from mean height at age, x — 2, (that is, X = 3 + 2 = 5 
weeks) as 

y — 8.8(2) = 17.6 centimeters 

Compare this with the sample deviation, F — y — 41 — 24= 17 centi- 
meters. Note that $ = F — y (figure 6.2). 

Instead of using deviations, we may return to the original measure- 
ments by substituting X — x for * and f' — y for y . Then 

? - y = 8.8(-Y - x) 

or r — 24 = 8.8 (A' — 3) centimeters 

for the soybean heights. According to the convenience of the user, this 
equation may be written as 

f = 24 + 8.8(A r - 3) 

or f = —2.4 + 8. 8 A' centimeters. 

The advantage of this latter form is its brevity. A disadvantage is that 
the constant term, —2.4, is a hodgepodge of two means and a regression 
coefficient with no clear biological meaning. If X = 1 week is now sub- 
stituted in either equation, f is found to be 6.4 centimeters, and the 
deviation from regression, F — T = 5 — 6.4 = — 1.4 centimeters. The 
other values of T and Y — T are recorded in table 6.4, and from them (or 
any two of them) the line in the figure is plotted. 


TABLE 6.4 

Regression Estimates of Soybfan Heights Contrasted With Observed Heights 



Average 

Regression 

Deviation From 

Square of 

Age 

Height 

Estimate 

Regression 

Deviation 


Y 

f 

Y - ? = d v . x 

(Y - 

1 

5 

6 4 

-1 4 

1.96 

2 

17 

15 2 

1 8 

3 24 

3 

24 

24 0 

0 0 

0 00 

4 

33 

32 8 

0 2 

0 04 

5 

41 

41 6 

-0.6 

0 36 

Sum 

120 

120 0 

0.0 

5.60 
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There are now apparent two notable features of this type of regression. 
First, the sum of f being the same as the sum of F, it follows that the sum 
of the deviations from regression is zero. This reminds one of the fact 
that the sum of deviations from the arithmetic mean is zero. Thus, the 
new average, has this distinctive characteristic of y: the regression 
line passes among the sample points in such manner that the sum of the 
deviations therefrom is zero; that is, the sum of the positive deviations 
from points above the line is equal to the sum of the negative from points 
below. 

The second feature is that S(Y — T) 2 = 5.60 is far less than the sum of 
the squares of deviations from mean, Sy 2 = 780. In fact, under our present 
hypotheses (homogeneous variance, X measured without error) 5.60 is 
less than the sum of squares of deviations from any other straight line. 
We are again struck by the analogy between It and y since the latter 
average also is situated among the sample points in such fashion that Sj> 2 
is a minimum. Reference to figure 6.2 will make this clearer. The sum of 
the squares of deviations (780) from the line, y = 24, is less than that from 
any other horizontal line; whereas the sum of squares, 5.60, is less than that 
from any other line (compare example 6.9). 

EXAMPLE 6.5 — On alternate days, random individuals were measured, as follows: 


Individual number 

1 

2 

3 

4 

5 

6 

7 

Day, .V 

2 

4 

6 

8 

10 

12 

14 

Weight, Y 

4 

2 

5 

9 

3 

11 

8 


Compute the regiession, F — 2 -f 0 5A', and the deviations from regression, 1, —2, 
0, 3, —4, 3, and —1 Graph the data and verify your results by measuring the devia- 
tions. 


EXAMPLE 6 6— The following weights of body and comb of 15-day-old White 
Leghorn male chicks are adapted from Snedecor and Breneman (13). 


Chick Number 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

Body weight (grams), X 

83 

72 

('9 

90 

90 

95 

95 

91 

75 

70 

Comb weight (milligrams), Y \ 

56 

42 

18 

84 

56 

107 

90 

68 

31 

48 


Calculate the regression, ? 60 T 2 302 (X — 85). 


EXAMPLE 6 7 — In the preceding example, calculate the sum of the squares of 
the deviations from regression Ans 1,554.8. Which comb weight deviates most from 
regression^ 

EXAMPLE 6.8 Construct the graph of the chick data, plotting body weight along 
the horizontal axis. Insert the regression line Do the deviations from regression verify 
the computed values^ 

EXAMPLE 6.9— Compare S(Y — ?)* = 5.60 (table 6 4) foi the regression f 
= a -f bX with the similar sum of squares, 10 84, for the regression thiough the origin. 
1 o get this latter figure, add the squares of the deviations in the last column of table 6.2. 
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Release from the restriction that the regression contain the origin has resulted in a 
better fit of the line to the points, in the sense that the less the sum of squares the better 
the fit. 

Extension of this comparison to the deviations from the first two lines considered in 
section 10.2 involves the introduction of weights. For the first method, the weights are 
inversely proportional to the squares of X so that the weighted sum of squares of the 
deviations in example 6.2 is 

(— 2.59) 2 (1/1 2 ) + (1.82) 2 (l/2 2 ) -h ... 4- (3.05) 2 (l/5) 2 ) _ B M 
(l/l 2 ) 4- (1/2*) T". . . 4- (1/5 2 ) 

This makes it plain that the deviation, —2.59, gets the most weight while 3.05 gets 
the least. 

In the method of averages the weights are inversely proportional to A, the weighted 
sum of squares of the deviations (example 6.3) being 

( — 3) 2 (1/1) + (1 ) 2 (1 / 2) + . . . 4- (0W5) _ v 
(1/1) + (1/2) -f . . . + (1/5) 

This indicates that among the various sets of hypotheses considered, the pair specifying 
(i) regression containing the origin and (ii) variance proportional to X results in the 
best sample fit. 

6.6 — Regression of injured fruit on crop size. It seems wise to 
interrupt the discussion at this point to introduce a new illustration, one 
in which the regression obviously does not contain the origin of coordinates. 

It is rather generally thought that the intensity of the injury by codling 
moth larvae is greater on apple trees bearing a small crop. Apparently 
the density of the flying moths tends towards uniformity, so that the 
chance of attack for any particular fruit is augmented if there are few 
fruits in the tree. The data in table 6.5 are adapted from the results of 
an experiment (9) containing evidence about this phenomenon. The 12 
trees were all given a calyx spray of lead arsenate followed by five cover 
sprays made up of 3 pounds of manganese arsenate and 1 quart of fish 
oil per 100 gallons. There is a decided tendency, emphasized in figure 6.3, 
for the percentage of wormy fruits to decrease as the number of apples in 
the tree increases. In this particular group of trees, the relation of the 
two variates is even closer than usual. 

The new feature in the calculations is the majority of negative products, 
xy, caused by the tendency of small values of Y to be associated with large 
values of A". The regression coefficient shows that the estimated per- 
centage of wormy apples decreases, as indicated by the minus sign, 1.013 
with each increase of 100 fruits in the crop. The regression line, and of 
course the percentage, falls away from the point, A(x , y ), by 1.013 for each 
unit of crop above 19 hundreds. 

The regression line brings into prominence the deviations from this 
moving average, deviations which measure the failure of crop size to ac- 
count for variation in the intensity of infestation. Trees number 4, 9, and 
11 had notably discrepant percentages of injured fruits, while numbers 2 
and 5 performed as expected. Presumably these are random deviations 
from the average (regression) values, but close observation of the trees 
during the flight of the moths might reveal some characteristics of this 
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phenomenon. Tree 4 might have been on the side from which the flight 
originated or perhaps its shape or situation caused poor applications of 
the spray. Trees 9 and 1 1 might have had some peculiarities of confor- 
mation or foliage that protected them. Careful study of trees 2 and 5 
might throw light on the kind of tree or location that receives normal in- 
festation. This kind of case study usually docs not affect the handling of the 
sample statistics, but it may add greatly to the investigator’s knowledge 
of his experimental material and may afford clues to the improvement of 
future experiments. 

There are two extremes among attitudes toward experimental data, 
both of which should be avoided. Some attend only to minute details of 
sample variation, neglecting the summarization of the data and the con- 
sequent inferences about the population. Others are impatient of the 
data themselves, rushing carelessly toward averages and other generaliza- 
tions. Either course fails to yield full information from the experiment. 
The competent investigator takes time to examine each datum together 
with the individual measured. He attempts to distinguish normal varia- 
tion from aberrant observations. He then judges his summaries and his 
population inferences and draws his conclusions against this vivid back- 
ground of sample facts. 

6.7 — Comparison of individuals and of group means. Adjusted 
% values of Y. You have doubtless been impressed by the fact that when 
there is regression the statement, “One tree had only 27% wormy fruits 
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in an orchard whose mean was 45%/’ may be inadequate. You immedi- 
ately ask, “What was the size of the tree crop?” If you learn that it was 
40 bushels you become aware that 27% was a greater rate of infestation 
than one would expect in a tree with so large a yield. It is not the orchard 
average with which the tree percentage should be compared, but it is 
the expected percentage for the individual crop size. The regression, 
in fact, replaces the mean as a standard of comparison. This new standard 
is not the same for all trees but varies according to the number of apples 
borne. 

In the case studies mentioned in the foregoing section two individuals 
are compared, not directly by their percentages but by contrasting their 
deviations from regression. The rates of infestation of two trees are im- 
mediately comparable only if their yields are the same. Otherwise, it is 
their errors of estimate that must be compared. From table 6.5 it is seen 
that, while trees 1 and 4 differ by 59 — 53 = 6% in actual percentages 
of wormy fruits, yet when they are compared on the basis of crop size the 
difference is 2.86 — 11.04 = —8.18%, meaning that if these trees had 
borne the same number of apples the percentage of injured fruits on 1 is 
estimated to have been less than that on 4 by 8.18%. On the basis of yield 
one would expect these trees to differ by 56.14 — 41.96 = 14.18%, 
whereas they actually differ by only 6% (6 — 14.18 = —8.18%). 

Instead of comparing deviations from regression it is often more con- 
venient first to adjust all values of Y to a common X (usually x ), then 
compare the adjusted percentages. This adjusted Y is y plus the devia- 
tion from regression (figure 6.4). To table 6.5 one could easily append a 



Fio. 6.4 — Regression of ) r on X . NP «* Y and MB =» (adjusted F) = y 4“ AB = 

-f GP y -f (deviation from regression). Also, MB = NE = Y — EP — Y — FG 
■ Y - b(AF) = Y - bx. 

column containing the 12 adjusted percentages, 47.86, 44.83, 47.90, 
56.04, etc. These repiesent the rates of infestation that might have been 
expected if the trees had all borne the mean number of apples, 1,900. 
Differences among these adjusted values of Y are identical with corre- 
sponding differences among the deviations from regression. Thus, you 
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see that the adjusted percentage for tree 4 is greater than that for tree 1 
by 56.04 - 47.86 « 8.18%. 

It isn’t necessary to calculate deviations from regression in order to 
arrive at adjusted percentages. The latter may be got directly by use of 
the formula, 

Adjusted Y = Y — bx, 

where b is the regression coefficient and x is deviation from mean, as usual. 
For example, take tree 1 in the table. Since Y = 59, b ~ —1.013 and 
x = —11, the adjusted percentage is 

59 - ( — 1 .01 3) ( — 11) = 47.86%, 

which checks with the result given above. 

Not only individuals, but also the means of two groups of percentages 
may be compared on the basis of their deviations from regression (or their 
adjusted values). The pertinent hypothesis is that the two groups have the 
same population regression, 0, and that b is its estimate. As an example, 
suppose the first four apple trees to be of variety A and the last four, B . 
Then the difference between their mean deviations from regression is 

(2.86 - 0.17 + 2.90 + 11.04)/4 - (- 6.00 - 2.95 - 7.91 + 3.27)/4 


the sample trees of variety B being less susceptible to attack. 


EXAMPLE 6.10 — Calculate adjusted Y for each of the eight tiees of varieties A 
and B in the paragraph just above, then show that the difference between the mean 
adjusted percentages of A and B is 7.56%. 

EXAMPLE 6.11 — You can get the same answer as in the foregoing example bv 
doing this: calculate the mean Y and mean x in each group, adjust each mean Y, 
then subtract. Can you still further shorten the process^ These short cuts are due to 
the linearity of the regression. 

EXAMPLE 6.12 — Another group of 12 trees, investigated by Hansberry and Rich- 
ardson, was sprayed with lead arsenate throughout the season. In addition, the fourth 
and fifth cover sprays contained 1 % mineral oil emulsion and nicotine sulfate at the 
rate of 1 pint per 100 gallons. The results are shown below. These facts may be 
verified: SX « 240, .ST « 384, Sx 1 = 808, Sy* = 1428, Sxy = — 582, regression co- 
efficient = — 0.7203, Y »= 46.41 — 0.7203Y, Y — Y for the first tree = 16.40%. 


Size of Crop, X 


Apples 

15, 

IS, 

12 , 

26, 

18, 

12, 

8, 

38, 

26, 

19, 

29, 22 

Percentage 
Wormy, Y 

52, 

46, 

38, 

3 7 , 

37, 

37, 

34, 

25, 

22, 

22, 

20, 14 
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EXAMPLE 6.13 — From the data on 98 lots of 6 pigs each the following results 
were computed: 


Variate 

Symbol 

Mean 

Variance 

Covariance 

Pounds of feed 
consumed per lot 

X 

4,669 0 

899,557 


Pounds gain in 
weight per lot 

Y 

1,180 7 

39,482 

178,959 


Verify the regression equation, 9 = 0.19894.Y -j~ 251 8. Do you see why the regression 
coefficient is carried to five significant figures? 


EXAMPLE 6.14 — Among the lots of pigs in the preceding example, lot A, con- 
suming 6,863 pounds of feed, gamed 1,612 pounds; while lot B, with 3,688 pounds of 
feed, gained 996 pounds. Aie the gains of these two lots in accord with expected gains? 
Represent graphically the regression equation, and plot points for A and B . 

EXAMPLE 6.15 — In lot C of the pigs above, X — 6,426 and Y =* 1,380 pounds. 
In lot D, X = 4,846 and Y — 1,362 pounds Compute the errors of estimate for these 
lots. Do you agree that the net difference between lot gains, after feed consumption 
is compensated, is 296.3 pounds -* Plot points corresponding to C and D, and observe 
the graphical interpretation of this difference. Observe that Y is almost the same in 
the two lots 

EXAMPLE 6.16 —The 98 lots of example 6.13 were grouped in various experi- 
ments conducted at various times and under various environmental conditions. In 
one of these experiments containing 17 lots, the average amount of feed per lot was 
4,007 pounds, with average lot gain 1,087 pounds. In a second with 13 lots, the figures 
were 6,565 pounds of feed and 1,546 pounds gain. Verify the fact that the net differ- 
ence in gain, after proper adjustment for feed consumption, averaged 50 pounds per lot. 

EXAMPLE 6.17 — Could you have got the estimated difference in the last example 
by using only the regression coefficient, 0.19894 pound gain per pound feed consumed, 
to reach the correct result? 

6.8 — Standard deviation from regression (or standard error of 
estimate). The deviations from regression are squared and then added 
in each of the tables 6.4 and 6.5. Attention was called to the fact that, 
under each hypothesis set up, this sum of squares is less than any similar 
sum that can be got from other straight lines passing among the datum 
points. In other words, the methods of calculation that we have adopted 
follow from the specification that the sum of the squares of deviations 
from linear regression shall be the minimum. One part of the theory 
provides that these deviations shall be randomly assorted, falling in the 
table without any predetermined pattern. This is another way of saying 
that the standard deviation in the population is the same for all values of 
X and is one distinction between the present method of fitting and others 
such as the first two described in section 6.2. 

For simplicity, the deviation from regression, Y — T, has been denoted 
by d v . x , indicating the deviation of Y from the regression of Y on X. Then 
the mean square of the deviations from regression, 

™JC£. = = 27.388 (applied to the apple data), 

n — 2 12 — 2 
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that is, the variance of the deviations, is an unbiased estimate of the 
population variance from regression. Its square root, \/27.388 = 5.23%, 
the standard deviation from regression, is an average of the deviations 
from regression in the next-to-last column of table 6.5. This is an estimate 
of that population standard deviation of Y which is the same at all values 
of X. It will be observed that the sum of the deviations from regression 
is zero so that there is no correction for mean to be subtracted from 
273.88. The new expression for degrees of freedom, n — 2, results from 
the fact that two averages involving Y have been taken from the sample 
data, first the mean, y> and now the regression coefficient, b. For each of 
them one degree of freedom must be deducted from n. If division were by 
any other number than n — 2, the estimate of variance would be biased. 

The standard deviation from regression is symbolized by s y . Xi denoting 
the variation in Y after its regression on X has been discounted. In the 
apple data, s y . x ~ 5.23% measures the amount of variation (in per- 
centage injured apples) not associated with, explained by, or dependent 
upon changing amounts of tree yield. Interpreting it graphically, this is 
an average of the vertical distances of the dots (figure 6.3) from the regres- 
sion line. 

It isn’t necessary to follow the calculations in the last three columns of 
table 6.5 in order to get Sd v . x 2 . It may be shown (example 6.22) that 

Sd v . 2 = Sy 2 - (Sxy) 2 /Sx 2 = 1,222 - (- 936) 2 /924 - 273.84, 

which is a more nearly exact figure than the 273.88 in the table because 
the latter is worse affected by rounding errors. 

EXAMPLE 6.18 — Applying the formula immediately above to the results in tabic 
6.3, calculate Sd v . z 2 and compare the answer with table 6.4. 

EXAMPLE 6.19 — From the result of the preceding example, calculate s v . x ~ 1.366 
cms. This is an average of what deviations? 

EXAMPLE 6.20 — In the chick-hormone data of example 6.7 you had Sx 2 ■* 1,000, 
Sy? * 6,854, Sxy = 2,302. Calculate s v . x = 13.9 milligrams. 

EXAMPLE 6.21 — From the soybean heights in table 6.1 there are available two 
independent estimates of the population variance, assumed constant at all the sample 
ages. The first is got by averaging 5 variances, one calculated from the heights of the 
four plants measured each week. The variance for the first week is r 2 = 3.33. Compute 
the other four, together with the mean of all five, 8.67. The second estimate is made 
from the variance-from-rcgiession of the 5 weekly means (example 6.19). This is 4r„.* 2 
*= 4(5. 6/3) = 7.47. Multiplication is by 4 because r v . x 2 is the variance of means, only 
one-fourth of the variance of plants. In chapters 10 and 11 will be found more ade- 
quate methods for treating these soybean data. 

EXAMPLE 6.22 — 1 hose interested in algebra will profit by deriving the formula, 
Sd v . v 2 = Sy 2 — (Sxy) 2 /Sx 2 . The easy way is to work with deviations from mean, 

dy* “ y - V * y ~ hx 

Square both sides of the equation, then sum, remembering that b ~ Sxy/Sx 2 . 

6.9 — Fiducial statements and tests of null hypotheses. Having in 
hand an unbiased estimate of a y . x 2 , the variance from regression in the 
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population, we are able to proceed with probability statements about 
the various averages of regression. 

First, there is the regression coefficient, &n estimate of the popula- 
tion coefficient, 0. In random sampling from this population, b is dis- 
tributed with a variance estimated by 

= Srl/S# 

Thus, in the apple data, 

s h 2 = 27.388/924 = 0.0296, = 0.172% 

Moreover, the quantity ( b — 0)/s b follows the /-distribution with n — 2 
degrees of freedom. Hence, fiducial limits may be set, 

/i = b — t.obSb and k ~ b + t os s bl (t M = 2.228, d.f. = 10) 

which, for the apples, are — 1.013 — 0.383 = —1.396 and — 1.013 
+ 0.383 = —0.630 percent per 100 fruits. If we say that the population 
regression is within these limits we shall be right unless our sample is one 
of the divergent kind that occurs about once in 20 trials. But perhaps 
it is our interest to test the hypothesis that £ = 0 (or some other value that 
may seem reasonable). Then 

t = ? = = 5.89 for the apples. 

s b 0.172 

Certainly this would be a remarkably aberrant result if the sampling were 
from the hypothetical population in which ft = 0. One would probably 
reject that hypothesis, then conclude that in the population there is a 
regression of percentage wormy apples on crop size, the value likely being 
between —0.630 and —1.396 percent per 100 fruits. 

Next consider the mean. In unrestricted sampling its variance is, 
as usual, s 2 /n. But in random sampling of both X and Y , as well as in 
sampling from a population confined to the same values of X as occurred in 
the sample under consideration (these values of X may be chosen in ad- 
vance of the sampling if desired, as they were in the soybean experiment) 
the opportunity for variation is less, the variance being s y . x 2 /n = 27.388 
/1 2 = 2.282 for the apples. Thus, y/2.282 = 1.51% is the standard 
error of the mean, 45%, under the circumstances stated." Fiducial limits 
may be set: h = 45 - (2.228) (1.51) = 41.6% and k = 45 + (2.228) 
(1.51) = 48.4%. Furthermore, any relevant null hypothesis may be 
tested. Clearly m = 0 is fantastic, but it might have been known that over 
a large fruit growing region in the same year the damage to apples was 
50%. To get evidence about the comparative rate of injury in this 
orchard, one might set up the hypothesis that this sample of apples was 
drawn from a population in which m = 50. Then 

t = (45 — 50)/ 1.51 = 3.31 (in absolute value) 

Since this value of t is outside the 1% level, 3.169, the mean of 45% would 
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be unusually divergent if the sampling were from the hypothetical m — 50% 
with trees having crop sizes from 800 to 4,000 apples. It might be con- 
cluded that (i) the spraying done was more effective than the usual prac- 
tice, or (ii) that this orchard was, for some unknown reason, less heavily 
attacked than others, or (iii) that it had more than the average number 
of apples per tree. 

A third set of statements may be made about any F which may be of 
interest. For the average crop size, 1,900 apples, the mean percent injury 
is estimated as F = y, the case just discussed. For any other X the esti- 
mate, F, is subject to an additional source of variation. Not only may 
the mean, y, be in error but also the regression, b. Since F = y + bx, its 
variance is the sum of the two independent variances of y and of bx. 
This is 


*r 2 



Hence, sf = s v . x \/l/n + x 2 /Sx 2 

It is clear that this standard error increases with x, that is, as X recedes 
from x. The reason for this is rather obvious in a graphical representation 
of regression. Any error in the slope of the line has a more pronounced 
effect on F the further removed it is from the center of the distribution. 
In the apple data, 


sy = \/27. 388(1/ 12 + x 2 /924) - V2.282 + 0.02964* 2 


Thus, for trees bearing crops like number 12, x = 21 and s? = 3 92%, 
notably greater than s$ = 1.51%. In ordinary practice the second term 
is relatively small within the sample range, but in forecasting beyond this 
range the term with x 2 may become, as above, predominant. If we now 
say that for trees bearing 4,000 fruits the population mean percentage of 
injured is between 23.73 — (2.228) (3.92) = 15.00% and 23.73 + (2.228) 
(3.92) = 32.46%, the probability of being right is 95%. 

One often wishes to make a fiducial statement about injury in a specific 
tree, F, rather than about the corresponding population mean, F. For 
that kind of statement, the tree variance, s v . x \ must be added, bringing 
the total to 

s v . x \\ + 1 In + x 2 /Sx 2 ) 

This implies that while the population mean percentage for trees with 
4,000 apples is probably between 15.00% and 32.46%, yet a randomly 
selected tree with that yield may be found to have anywhere between 
9,16% and 38.30% injured fruits You see, predicting the performance of 
individuals, especially of those in the outer fringe, is a hazardous venture 
unless s v . x is unusually small. 

For purposes of illustration we have made all the probability state- 
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merits above as though they were independent of each other. Statements 
about y and b are independent, but it is clear that probabilities related to 
Y and Y are determined in part by the sam£ accidents of sampling that 
produced y and b. It is proper to consider the implications of all these 
statements. Only remember that if it should be found, for example, that 
the population mean lies beyond the sample based fiducial limits, then the 
probability would be much greater than .05 that the population value of 
regression Y is outside the fiducial limits based on Y. 

EXAMPLE 6.23 — In the regression of comb weight on body weight, example 6.7, 
n ** 10, £ * 83 grams, y « 60 mg. Sx 1 « 1,000, Sy* = 6,854 and Sxy — 2,302. 
Set 5% fiducial limits on y t assuming the same set of body weights. Ans. 49.8 — 70.2 
mg. 

EXAMPLE 6.24 —In the chick data, b *= 2.302. Test the hypothesis that /3 ** 0. 
Ans t “* 5.22, P < .01. 

EXAMPLE 6.25 — Since evidently there is a population regression of comb weight 
on body weight, set 95% fiducial limits to the regression coefficient. Ans. 1.28 — 3.32 
mg per gram. 

EXAMPLE 6.26 — Predict the population average comb weight of 100-gram chicks. 
Ans. 99.1 mg. with 95% limits, 79.0 — 119.2 mg. 

EXAMPLE 6.27—Set 95% fiducial limits to the forecast of the comb weight of a 
randomly chosen 100-gram chick. Ans. 61.3 — 136.9 mg. 

6.10 — Short-cut methods of computation in regression. So far in 

this chapter you have not been greatly burdened with computation. In 
practice, regression calculations are rather tedious so that either machines 
are used or some form of coding is practiced. Short cuts in regression are 
mainly those already explained in chapter 5, only a single new feature 
being introduced. The data in table 6.6 will serve as illustration. 

TABLE 6.6 

Initial Weights and Gains in Weight (Grams') of 15 Female Rats on High 
Pro i i:in Dii - r, 28tii to 84th Days of Age 


i 

Rat Number 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

Initial 
weight, .V 

50 

6-4 

76 

64 

74 

60 

69 

68 

56 

48 

57 

59 

46 

45 

65 

Gain, Y \ 

| 128 159 

158 

119 

133 

112 

96 

126 

132 

118 

107 

106 

82 

103 

104 


If a machine is available the methods of section 5.2 are easily extended 
to the calculation of regression. In most machines the sum of cither 
variate is run up along with the sum of its squares; that is, both SX and SX 2 
appear in the recording dials at the end of a single series of operations. 
As for the sum of products, 

(50)(128) + (64) (159) + . . . + (65)(104) = 108,530, 
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it is also read from the dials after the sequence of multiplications, none 
of the individual products being recorded. Your book of instructions, or 
the nearest agent, will furnish directions. In some machines the two sums, 
the two sums of squares, and the sum of products may all be accumulated 
at one run. If there are available machines for tabulating data punched 
in cards, they will be found convenient for handling large samples (100 or 
more sets of observations) (1). The results from any of these machines 
are entered in a tabic like 6.7. One advantage of a machine is the avoid- 

TABLE 6.7 

Computations in Regression if Machine Is Used. Rat Data From Table 6.6 


SX « 

901 

SY 

= 

1783 

n = 

15 

£ - 

60.07 grams 

y 

sx 

118.87 grams 



SX 2 • 

55,465 

SY 2 

cs 

218,297 

SXY *» 

108,530 

{sxy/n - 

54,120.07 

(SY)Vn 

sx 

211,939.27 

(SX)(SY)/n - 

107,098.87 

.Sx* * 

1,344.93 

Sf 

- 

6,357.73 

Sxy “ 

1,431.13 


ance of much copying of numbers. This enhances both the accuracy and 
the rapidity of calculation. The only entries that need be made are those 
in table 6.7. Note the one new feature, the correction term for the sum 
of products. This contains the product of SX by SY instead of the square 
of either, rather a reasonable method of correcting products. Don’t 
overlook the fact that the correction term may be larger than SAT, 
making Sxy negative. This would indicate a downward sloping regression 
line. 

The six quantities, 

n, x, y, Sx 2 , Sf, Sxy, 

constitute the entire information necessary for completing the computa- 
tions of regression. We have, for example, 

b = Sxy/Sx* = 1,431.13/1,344.93 = 1.0641 

Gain in weight among such female rats is, therefore, partly predictable, 
each gram increase in initial weight corresponding to 1.0641 grams 
increase in gain. However, 

Si,.,* = Sf - (SxyY/Sx? « 6,357.73 - (1,431.13)71,344.93 = 4,834.88, 
j ,.* 2 = Sd,.,*/(n - 2) = 4,834.88/13 = 371.91, 
r» 2 = s v . z 2 /Sx 2 = 371.91/1,344.93 = 0.2765 
s> = -s/o.2765 = 0.526 
t = b/st = 1.0641/0.526 = 2.02 

The large value of s b relative to b will be obvious if you graph the data in 
the manner of table 6.2. Since t does not quite reach the 5% level one 
must not be too confident about population facts. Nevertheless, the sam- 
ple estimate cannot be ignored. There is a good deal of evidence to sup- 
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port some relation between initial weight and gain. We have here some 
information about experimental design and shall return to these results 
in the succeeding section. 

The student is now prepared to read chapters 10 and 77, analysts of variance , or he may dip 
into chapters 7 and 9. 


If a calculating machine is not available, one may profit by following 
directions for coding given in chapter 5. Each variate should be reduced 
to a series of integers centering on zero and having a range between 20 
and 50 units. This not only makes the calculation relatively easy but also 
produces results sufficiently accurate for ordinary purposes. The process 
is illustrated in table 6.8. These data were taken from a study of the forces 

TABLE 6.8 

Draft and Speed of Plows Drawn by Tractors 
Illustration of coding in computation of regression 


Speed 
Miles per 
Hour 

V 

Draft 

Pounds 

D 

Code Nu mbers 
(10F - 30)/2 (D/10) - 50 

(rounded) (rounded) 

X Y 

Squares of 
Code 
Numbers 

X s Y 2 

Products 
of Code 
Numbers 
XY 

0 9 

425 

-10 

- 8 

100 

64 

80 

1 3 

420 

- 8 

- 8 

64 

64 

64 

2 0 

480 

[ - 5 

- 2 

25 

4 

10 

2 7 

495 

- 2 

0 

4 

0 

0 

3 4 

540 

2 

4 

4 

16 

8 

3 4 

530 

2 

3 

4 

9 

6 

4 1 

590 

6 

9 

36 

81 

54 

5 2 

610 

11 

11 

121 

121 

121 

5 5 

690 

12 

19 

144 

361 

228 

6 0 

680 

15 

18 

225 

324 

270 

34 5 

5460 

48 

-25 

23 

64 

-18 

46 

727 

1044 

841 


« 23 

£ = 2.3 code units 
_ (2) (2.3) + 30 

io 

** 3.46 mi. /hr. 
SX* « 727 
{sxy/n « 52.9 

Sx 2 « 674.1 
Sv* » (674.1 ) (2/10) 9 
« 26.964 


SY - 46 

= 4.6 code units 
- (4.6 + 50) (10) 
*= 546 pounds 
SY* - 1,044 
(SY)*/n - 211.6 

Sy* =* 832.4 
&P - (832.4)(10) 2 
= 83,240 


SXY - 841 
(SX)(SY)/n - 105.8 


Sxy « 735.2 

Svd « (735.2) (2/1 0) (10) 


1,470.4 
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necessary to draw plows at the speeds commonly attained by tractors (2). 
The character of the soil and depth of the furrow were experimentally 
controlled as closely as possible. Within the limits of the trial the draft 
was roughly proportional to the speed, as a graphical representation of 
the data will indicate. 

In order to avoid large numbers and decimals the coding was carried 
to an extreme. Doubtless some will prefer the decimals and large numbers: 
an argument against coding is the opportunity for mistakes in the pro- 
cedure itself. However, familiarity makes the process less forbidding, 
and there are data whose coding is certainly convenient. 

The decoding offers some novel features. The sums of squares must 
be decoded by use of the squares of the coding ratios and the sum of prod- 
ucts by means of their product. If this doesn’t seem reasonable, try 
extracting the square roots of the sums of squares, decoding in the usual 
manner, then squaring the results. 

The regression coefficient, b = Svd/Sv 2 = 1,470.4/26.964 = 54,53, 
shows that the drawbar pull increased an average of 54.53 pounds for 
each mile per hour increase in speed. The formula for estimating draft at 
various speeds is 

t> = 546 + 54.53 (V - 3.46) = 54.53 V + 357.3 pounds 

Since the use of this regression formula may lead to the prediction of 
drawbar pulls in individual trials, it is pertinent to calculate the variance 
So 2 , of a randomly selected D: 

S(D - D) 2 = S(P - (Svd) 2 / Sir = 83,240 - (1,470.4)726.964 = 3,056 
W - S{D - Z))V (n - 2) •= 3,056/8 - 382.0 
J/, 2 = s d .?( 1 + 1 In + v~/Sv 2 ) =* 382.0(1 + 0.1 + r/26.964) 

= 420.2 + 14.167* 2 

As an example, let us estimate the drawbar pull of a tractor making 
5 miles per hour. 

1) = 54.53(5) + 357.3 - 630 pounds 
v = 5 — 3.46 == 1 54 miles per hour 
Si? = 420.2 + 14.167(1.54)* = 453.8 
Standard error = 21.3 pounds, 1 05 = 2.306 

Fiducial limits : /1 = 630 — (21. 3) (2. 306) = 581, k = 679 pounds 


EXAMPLE 6.28 — Speed records attained in the Indianapolis Memorial Day 
automobile races, 1911-1939 (miles per hour): 


Year 

1911 1912 1913 1914 1915 

1916 

1917 1918 

1919 1920 

Coded year, X 

1 2 3 4 5 

6 

7 8 

9 10 

Speed, Y 

74.7 78 7 75 9 82.5 89 

8 83 

3 (no races) 

88.1 88.5 
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Year 

1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 

Coded year, A' 

11 12 13 14 15 16 17 18 19 20 

Speed, Y 

89.6 94.5 91.0 98.21011 95.9 97.5 99.5 97.6100.4 


Year 

1931 

1932 

1933 

1934 1935 1936 

1937 

1938 1939 


Coded year, X 

21 

22 

23 

24 25 26 

27 

28 29 


Speed, Y 

96 6 

104 

1 104 

1 104 9 106 2 109. 

1 113.6 

117.2 115.0 



Note that n * 27. Calculate* - 15.56, y - 96.21, - 1,908.67, Ay* * 3,398.60, 

Axy « 2,454.69. 

EXAMPLE 6.29 — Estimate the population mean speed for the year 1917. Ans. 
85.2 miles per hour with 95% limits, 83.44 — 86.96 miles per hour. 

EXAMPLE 6.30— Forecast the 1941 speed. Ans. 116.07 miles per hour. Remem- 
bering that an individual speed is being forecast, not the population mean, calculate 
the 95% fiducial limits. Ans. 109.16 — 122.98 miles per hour. Actually, the 1941 
speed, in the last pre-war race, was 115.12 miles per hour. 

EXAMPLE 6.31 — On the graph of speed against date, one can draw the curves 
representing the fiducial limits for each year’s speed. These will be closest to the re- 
gression line at the mean year (coded * = 15.56, corresponding to 1910 *T 15.56 
— 1925.56), and will deviate further from the regression (measured vertically) as X 
departs from its mean cither way. T he curves are the two branches of an hyperbola. 
If t 06 is selected, about 95% of the records should fall between the curves if the sampling 
is random. 

EXAMPLE 6.32 — It was desired to forecast the yields of 29 double and three-way 
crosses of maize under heavy second brood chinch bug infestation (10) The basis of 
prognostication was the known mean yields of top crosses of the parental lines grown 
under the same conditions. A formula designed by Jenkins (11) was used. The fore- 
casted yields, A', and the actual experimental yields, Y, read from a scatter diagram, 
were as follows: 20, 20; 24, 29; 25, 34; 26, 35; 29, 43; 31, 11; 33, 24; 34, 35; 34, 45; 
35, 34; 36, 34; 36, 36; 37, 35, 37, 46; 37, 42; 41, 41; 42, 43; 43, 43; 43, 55; 50, 53; 
50, 55, 51, 54; 51, 55; 52, 42; 55, 55; 56, 54; 56, 51; 57, 50; 71, 61 bushels per acre. 
Construct the scatter diagram. Do you think the regression is linear? Assuming that 
it is, calculate the regression of Y on A r , ? == 0.79 A' -j- 9.4. 

EXAMPLE 6.33 — In the course of his investigations of winter hardiness in apple 
trees, Stark (14) wished to determine the specific heat, T, of the twigs. The laboratory 
technique was difficult and inaccurate. Since more than half the weight of the twigs 
was water, and since the heat capacity of w r ater is great, he tried estimating 1" from the 
percentage of water, X , in the twigs, the latter being easily and accurately measured. 
Twenty-one pairs of Stark’s measurements, X and T, are as follows* 49.4, 0.646, 

50.1, 0.644; 50.8, 0.665; 51.2, 0.670; 51.5, 0.666; 51.9, 0.653; 52.5, 0.6b9; 52.7, 0.657; 

53.1, 0.689, 53.6, 0.669; 55.7, 0.685; 56.3, 0.696; 57.0, 0.700; 58.0, 0.690; 58.5, 0.711, 

59.2, 0.704; 59.7, 0.696; 61.3, 0.713; 62.0, 0.719; 63.1, 0.731; 64.9, 0.731. Compute 
* - 55.83%, Sx* - 431.31, y - 0.6859, Sj* « 0.0142, Sxy - 2.3536, f » 0.005457* 
T 0.3812. Since s v » 0.027 and s u . t = 0.0086, about a third of the variation of Y 
is unassociated with X . Since A' is determined with very little error, Stark concluded 
that Y could be determined with sufficient accuracy by first measuring the percentage 
of water in the apple twigs, then estimating specific heat from the regression equation. 

EXAMPLE 6.34— Derive the formula, Sxy = SXY— (SX)(SY)/n. Hint: Start 
with X ■* x -f * and Y *■ y -f y. Multiply and add, remembering that Sx »= Sy ** 0. 
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6.11 — Use of regression in experimental design. Statistical control. 

As anticipated in section 3.10, regression is one means of reducing the 
unexplained error encountered in experimentation. Variation not con- 
trolled in the conduct of the experiment is often associated with some 
measurable variate, and the measurement of this variate enables the 
investigator to use regression for increasing the information available. 
An example is found in the rat gains of table 6.6. Though it was known 
that gain is usually associated with initial weight, it was not feasible to 
control this extraneous variate; that is, to use only rats all having the same 
starting weight. It was easy, however, to measure the variate, X> and to 
calculate the regression of gain upon it. What advantage was attained? 
Without this precaution experimental error would have been based on 
the rat variance, 

j* « Sf/{n - 1) = 6,357.73/14 - 454.12 
But utilization of known initial weights reduced the error variance to 

Sy.* = 371.91 

Following the argument of section 3.10, it may be said that the informa- 
tion per rat has been increased in the ratio, 454.12/371.91 = 122%. 
Another way of saying it is that, for the same reliability, 22% more rats 
would have been required if the facts about initial weight had not been 
utilized. The relatively inexpensive measurement and use of initial weight 
increased the information from this experiment by 22%. 

Even if experimental control is feasible it is not always desirable. 
Thus, in the experiment with tractor-drawn plows one would certainly 
not wish to confine his trials to a single speed. On the other hand, he 
would get little information if he considered 

x 2 = s<P/(n - 1) « 83,240/9 - 9,249 

as the experimental variance. This experiment having been designed for 
use of regression, the appropriate estimate of random experimental 
variance is 

W 2 = 382.0 

Not only is the experiment founded on a broader basis of experience, but 
the information has been increased in the ratio of 9,249/382.0 = 2421%; 
that is, the information per trial is 2321% greater than it would have been 
if no use had been made of the speed records. 

It appears, then, that in many situations where it is either not con- 
venient or not desired to control experimentally a variate associated with 
the one under investigation, statistical control (15) may be introduced. 
By measuring the auxiliary quantity and using regression one may accom- 
plish the same end as by keeping this quantity constant in the experiment. 
The error, is presumably no greater than s v would be in an experi- 
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mentally controlled sample with constant X. Indeed, it seems that in 
many cases statistical control is more to be desired; the actual situation is 
studied instead of one artificially produced, the observations are extended 
over a greater range, thus broadening the foundation for inference, and 
in the end one has knowledge of the variation of two quantities instead of 
one, together with the relation between them. In each line of research 
it must be learned by experience which type of control is more readily 
available, and which produces information more quickly and cheaply. 


EXAMPLE 6.35 — The chick data of example 6.7 were adapted from the original 
records of 1 3 birds whose body weights (X grains) and comb weights (F mg.) were 
as follows; 83, 54; 72, 42; 69, 29; 78, 37; 69, 18; 90, 84; 90, 56; 95, 107; 95, 90; 91, 68; 
75, 31; 70, 48; 76, 41. From these records, n = 13, £ = 81.0 grams, y * 54.2 mg., 
Sx * — 1,218.00, =* 8,292.31, Sxy « 2834.00. If decisions are to be based on comb 

weight, estimate the increase in information due to use of body weight as a statistical 
control. Ans. 348%. 

EXAMPLE 6.36 — From the apple data of table 6.5, estimate the decrease in pre- 
cision (information) that would result from omitting crop size as a statistical control. 
Ans. 75.3%. 


EXAMPLE 6.37 — For 1594 farms in Crop Estimates District 9 of eastern South 
Dakota (1942), X -* acres in farm and F *= acres in corn were recorded. The variance 
of corn acres was s v 2 = 1,859, while that of deviations from regression was ** 1,002. 
If you collected a random sample of 100 farms from which to estimate the average 
number of acres in corn, s„ ? would be 18.59. If you used farm size as a statistical con- 
trol, how many farms would be required in the sample to get the same amount of in- 
formation? Ans. 54 farms. 


6.12 — Partitioning the sum of squares of the dependent variate. 

Regression computations may be looked upon as a process of partitioning 
SY 2 into 3 parts which are both useful and meaningful. You have become 
accustomed to dividing SY 2 into (SY) 2 /n and the remainder, SjP; then 
subdividing Sy 2 into (Sxy) 2 /Sx 2 and Sd v . x 2 . This means that you have di- 
vided SY 2 into three portions: 

SY 2 = (, SY) 2 /n + (Sxy) 2 /Sx 2 + Sd v . 2 

Each of these portions can be associated exactly with the sum of squares 
of a segment of the ordinates, Y. To illustrate this a simple set of data 
has been set up in table 6.9 and graphed in figure 6.5. v 


TABLE 6.9 

Data Set Up to Illustrate the Partition of SY 2 


X | 2 

4 

6 

8 10 

12 

14 1 

SX » 56 

Y J 4 

2 

5 

9 3 

11 

8 

SY - 42 

n « 7, 

CO 

1 

V - 6, 

Sx* « 112, 

sy - 68, 

Sxy «* 56 
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Fig. 6.5 — Graph of data in table 6.9. The ordinate at X = 12 is shown divided 
into 2 parts, y = 6 and y = 5. Then is subdivided into # =» 2 and «= 3. Thus 
Y~'g+Q+d us ~>6+2+ 3-11. 


In the figure the sixth ordinate is shown partitioned into 3 segments: 

= y + y ~h d y . x 

Each of the other ordinates may be divided similarly, though negative 
segments make the geometry less obvious. The lengths are all set out in 
table 6.10 and the several segments are emphasized in figure 6.6. Observe 

TABLE 6 10 

Lengths of Ordinates in Table 6.9 Togfiher With Segments into Which They 

Are Partitioned 


Pair Number 

Ordinate 

Y 

Mean 

y 

Deviation 

y 

Deviation Trom 
Repression 

dy.r 

1 ! 

4 

6 

— 3 

1 

2 

2 

6 

— 2 

__ 2 

3 

5 

6 

- ] 

0 

4 

9 

6 

0 

3 

5 

3 

6 

1 

— 4 

6 

11 

6 

2 

3 

7 

8 

6 

3 

-1 


42 

42 

0 

0 

Sum of squares 

320 

252 

28 

40 
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Fig. 6.6(a) — Corresponding to the mean, these are the segments subtracted 
from Y leaving the deviations from mean, y, shown in figure 6 0(b). The sum of the 
squares of these 7 segments, 7 ( 6 ' 2 ) = 252, corresponds to the 1 degree of freedom for 
the correction term, (SY)*/n *= ( 42) 2 /7 = 252. 

Fig. 6.6(b) — These are deviations from mean, y, further subdivided in figuies 
6.6(c) and (d). 

Fig. 6.6(c) — Heavy lines show the deviations of regression values, ?, from their 
mean, y. They are completely determined by y , b and a fixed set of *’s. The sum of 
their squares, Sff* *■ 28, is the portion of SY 2 attributable to regression. 

Fig. 6.6(d) — Random deviations from regression, d trx . Each d v . x added to the 
corresponding y and $ of figures 6.6(a) and (c) comprises an ordinate, Y, in figure 6.5. 
The sum of the squares of these deviations, Sd„. z 2 40, corresponding to 5 degrees of 
freedom, furnishes an estimate, s y . x 2 = 40/5 » 8, of a 2 . 
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that in each line of the table (including the two at the bottom) the sum of 
the last three numbers is equal to the number in column Y. 

It is notable that, though 

Y = y + # + dyx, 

yet the sum of squares is 

SY 2 = Sy* + Sf + Sd y . x \ 

each of the three product terms being zero (example 6.42). The sums of 
squares of the ordinates, SY 2 = 320, and of the deviations from regression, 
Sdy. z 2 — 40, are already familiar. It remains only to identify ( SY) 2 /n 
with Sy 2 and (. Sxy) 2 /Sx 2 with S$ 2 . 

First, 

(SYY/n = ntf = Sy* 

n 2 

That is, the correction for mean is simply the sum of the squares of the 
mean taken n times. The reason it is not so used in calculation is that 
any rounding inaccuracy in y would be exaggerated both by squaring 
and by multiplication with n. 

Second, 

= Sbh* = Sy* 

Sx* (Sx*y 

So the sum of squares attributable to regression turns out to be the sum of 
the squares of the n deviations of f' from its mean, y. Certainly, it would 
be a tedious job to calculate Si) 2 every time one wished the sum of squares 
due to regression. You are now in a position to evaluate the short cut, 
(■ Sxy) 2 /Sx 2 . 

I hope you appreciate the beauty of these regression computations. 
Make each formula meaningful, not merely a jumble of letters and figures. 
Associate each with a column in the table and with a line segment in the 
figure as well as with the descriptive phrase which has been used. Famili- 
arity breeds not only utility but admiration. 

The partition of SY 2 is intimately associated with the division of the 
total available degrees of freedom into 3 parts, one part corresponding 
to each portion of .ST 2 (table 6.11). This identification of the several 
sums of squares with corresponding degrees of freedom is one of the useful 
contributions of Professor R. A. Fisher (7). 

It will be observed that zero is the sum of the deviations from the 
horizontal line of figure 6.6 (a) and (b) as well as from the regression line 
of figure 6.6 (d). In fact, this would be true of any line through the origin 
of deviations, ( x , y). The sum of the squares of the deviations in figure 
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TABLE 6.11 

Partition of Degrees of Freedom Corresponding to 3 Sums of Squares 


Description 

Symbol 

Degrees of 
Freedom 

Sum of 

Squares Variance 

The mean 

y 

1 

Stf - (SY)*/n 

Deviations determined by b 
and a fixed set of *’s 

Q ** bx 

1 

sp ** (SxyY/Sx* 

Random deviations from 
regression 

d u . x 

n - 2 

Sdy.J Sy.f 

Total 

Y 

n 

SY* 


6.6 (b) is less than the corresponding sum from any other horizontal line. 
But if you begin to rotate this line counterclockwise about the center, 
(x, #), the sum of squares becomes smaller and smaller till the position 
of the regression line is reached. It is the introduction of a second variate, 
such as crop size in the apple data, that gives meaning to this rotation. 
So long as there is only injured apples to consider, with no hint as to 
differences in crop size, the horizontal line is the only one interpre table. 
In fact, all the points might as well be plotted on some vertical line, such 
as OY of figure 6.3. In that event, the deviations from the origin, y , would 
be exactly those considered in the earlier chapters. But added information 
about crop size warrants the extension of our ideas to include regression. 
Much of the variation in percentage injured fruits is eventually explained 
by differences in crop size. The only remaining variation is that asso- 

EXAMPLE 6.38 — Dawes (4) determined the “density” of the melanin content 
of the skin of 24 male frogs together with their weights. Since “Some of the 24 males . . . 
were selected for extreme duskiness or pallor so as to provide a measure of the extent 
of variability,” that is, since selection was exercised on density this variate cannot be 
estimated (section 6.3) from the sample. Assuming random weights, they may be taken 
as Y. The data follow: 


Density, X 

0 

13 

0 

15 

0 

28 

0 

58 

0 

68 

0 

31 

0 

35 

0 

58 

Weight, Y 

13 


18 


18 


18 


18 


19 


21 


22 


Density, X 

0 

03 

0 

69 

0 

38 

0 

54 

1 

.00 

0 

73 

0 

77 

0 

82 

Weight, Y 

22 


24 


25 


25 


25 


27 


27 


27 


Density, X 

1 

29 

0 

.70 

0 

38 

0 

54 

1 

.08 

0 

86 

0 

40 

1 

67 

Weight, Y 

28 


29 


30 


30 


35 


37 


39 


42 



Calculate £ = 0.6225 units, y * 25.79 grams, Sx* = 3.3276, Sy* ■■ 1,211.96, Sxy 
= 40.022. 
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dated with the deviations from the regression line due, according to our 
hypothesis, to sampling variation of injury to the apple crop. The selec- 
tion of that line for which the sum of such deviations is zero, and the sum 
of their squares a minimum, is an effect of the logical extension of those 
ideas that led to the selection of the mean as an appropriate average of 
concentration. The principle of least squares is simply a statement of 
the unity in our ideas of averages for single variates and regressions for 
two or more. 

EXAMPLE 6.39 — In example 6.38 test the hypothesis, /3 «* 0. Ans. t =* 3.81, 
P < 0.01. 

EXAMPLE 6 40- — In the frog data divide SY 2 into its 3 components, thus: 


Source of Variation 

Degrees of 
Freedom 

Sum oi 
Squares 

Variance 

Mean 

Regression 

Deviations 

1 

1 

22 

15,965 04 
481.36 

730 60 

33 21 

Total 

i; 24 

17,177 00 



EXAMPLE 6 41 — How nearly free from error is the measurement of melanin 
density, X ? After preparation of a solution from the skin of the frogs, the intensity of 
the color was evaluated in a colorimeter and the readings then transferred graphically 
into neutral densities. The figures reported are means of fiom 3 to 6 determinations 
The error of this kind of measurement is usually appreciable, so that the estimate of 
regression may be biased downwards. Had not the investigator wished to learn about 
extremes of density, the regression of density on weight might have been not only 
unbiased but more informative. 

EXAMPLE 6.42 — Prove that the sums of the cross products in .ST 2 are all zero. 
The only one that offers any difficulty is S(jd v . x : set y = bx and d y . K — y — fj — y — bx. 
Multiply and add, noting that Sxy = bSx 1 . 


6.13 — Galton’s use of the term “regression.” In his studies of in- 
heritance Galton developed the idea of regression. Of the “law of uni- 
versal regression” (8) he said, “Each peculiarity in a man is shared by 
his kinsman, but on the average in a less degree.” His friend, Karl Pearson 
(12), collected more than a thousand records of stature, cubit and span 
in family groups. Figure 6.7 shows his regression of son’s stature on father’s 
Though tall fathers do tend to have tall sons, yet the average height of 
sons of a group of tall fathers is less than their father’s height. There is a 
regression , or going back, of sons’ heights toward the average height of all 
men. 

As part of Galton’s evidence he reported the results of an experiment 
on the inheritance of size in sweet pea seeds. For planting he selected 
100 seeds in each of seven size groups, then ascertained the average size 
of the offspring. He gives the following data on diameter of parent sweet 
peas and mean diameter of filial seeds: 21, 17.5; 20, 17.3; 19, 16.0; 
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Fro. 6.7 — Regression of son’s stature on father’s (12). P = 0.516X -f~ 33.73. 1078 
families. 

18, 16.3; 17, 15.6; 16, 16.0; 15, 15.3; all measurements in hundredths of 
an inch. 

The regression coefficient of filial size on parent size, that is Y on T, 
was 0.3. Gal ton thought that regression was generally about one-third, 
the average of the offspring deviating from the racial mean only about a 
third as much as the deviation of the same measurement in the lot of 
parents having a common measure. 

If you are a student of genetics you will realize that this is not a simple 
phenomenon, involving as it does such concepts as heterozygosity, domi- 
nance, etc. An additional complexity in stature is the tendency toward 
the selective mating of men and women of similar stature. Apparently this 
law of universal regression summarizes those tendencies in a species to 
preserve its unique characteristics. If tall fathers begot taller daughters 
and tall daughters bore taller sons there would be no regression, and the 
two related concepts, species and average , would presumably not exist. The 
average of concentration in statistics is that measurement of a character 
toward which regression is directed, while the average of variation stays 
constant from generation to generation because of this regression. 


EXAMPLE 6.43 —Warren (16) investigated the magnitude of regression in par- 
thenogenetic generations of aphids. He measured the frontal breadth in units of 
0.0221 mm., the results for parents (A') and offspring ( Y ) being as follows: 14 8, 13.7; 

18.4, 14.6; 17.5, 14.5; 16.3, 14.3; 16.9, 14.7, 15.8, 14.2; 15 4, 14.8; 15.2, 15.0; 15.9, 

15.8; 16.1, 15 7; 16.3, 15.1; 16.7, 15.6; 17.0, 15 5; 17.8, 15.9; 18.1, 15.4; 18.5, 15.8; 

18.9, 15.2; 15.8, 16.2; 16.0, 16.0; 16.2, 16.7; 16.6, 16.6; 16.8, 16.9; 17.5, 16.4; 18.2, 

16.3; 18.8, 16.7; 19 0, 16.9; 15.7, 17.2; 16.1, 17.0; 16.5, 17.6; 16.8, 17.4; 17.1, 17.8; 

17.7, 17.5; 18.0, 17.7; 18.8, 17.9; 19.5, 17.6; 16.0, 18.1; 17.5, 18.8; 19.0, 18.4, 18.9, 

19.5. Compute the regression coefficient, 0.4469. 
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6.14 — Regression and rate (or ratio). In section 6.2 you learned 
that the growth rates, F / X , could be appropriately averaged if two condi- 
tions were met: (i) the regression passes through the origin, and (ii) the 
weekly standard deviation increases in proportion to age. An example of 
this kind was encountered in studying the percentage of farm land in corn 
among eastern South Dakota farms (1942). For illustration we selected 
5 common farm sizes and chose at random 20 farms from each (table 6.12). 

TABLE 6.12 

Data From Eastern South Dakota Farms, 1942 
Twenty farms chosen at random from each selected size group. 


Mean 


Size of Farm 
(acres) 

X 

Average 
Number 
Acres in Corn 

Y 

Standard 

Deviation 

(acres) 

s v 

Ratio 

sJX 

Ratio 

Y/X 

80 

21 30 

8 50 

0 106 

0 2662 

160 

42 00 

15 98 

0 100 

0 2625 

240 

66 25 

20.38 

0 085 

0 2760 

320 

64 60 

28 51 

0 089 

0 2019 

400 

88 90 

39 90 

0 100 

0 2222 

240 

56 61 



0.2458 


The ratio, s y /X, is so nearly constant that the population standard devia- 
tion may be assumed proportional to acreage. If the further reasonable 
assumption is made, that the population regression contains the origin, 
then the least squares estimate of the change in corn acres per acre increase 
in farm size is the mean of the ratios, Y/X; that is, 0.2458 or 24.58%. The 
regression equation, 

? = 0.2458X, 

leads to the estimate of corn acreage in the average sized farm (240 acres), 
? = (0.2458) (240) = 58.99 acres 

If you plot corn acres, F, against farm acres, X , then plot the regres- 
sion, you will see the effect of the small weights assigned to the larger 
farm sizes on account of their larger standard deviations (lesser reliability). 
The regression line hugs the dots for the smaller farms. Another point of 
interest is this: the average number of corn acres calculated in column 2 
of the table is not the correct least squares estimate, 58.99 corn acres for a 
240-acre farm, but is biased downwards. In other words, the point A 
( x , y) is not on this regression line fitted under the two conditions specified 
above. 

In view of the extensive use of rates and ratios, it may be well to exam- 
ine a situation in which the practice is questionable. Employing appro- 
priate regression methods, Crampton (3) reported the gains in weight (F) 
of pigs eating recorded amounts of feed (X). For his lot I (table 6.13), 
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TABLE 6.13 

Data on Lot I of Crampton’s Pigs 


Feed Eaten 
(pounds) 

X 

Gain 

(pounds) 

Y 

Ratio 

Y/X 

Predicted Gain 

? - 0.1 999 X 

t = 0.2627A' - 22.1 

382 

66 

0 1728 

78 3 

76.4 

335 

72 

0 2149 

65.9 

67.0 

388 

84 

0 2165 

79 8 

77.6 

316 

47 

0.1487 

60 9 

63.2 

319 

75 

0 2351 

61 7 

63.8 

399 

87 

0.2180 

82.7 

79.8 

358 

75 

0 2095 

72.0 

71 6 

355 

73 

0.2056 

71.2 

71.0 

344 

59 

0.1715 

68 3 

68.8 

339 

70 

0.2065 

67 0 

67.8 

Mean 353 5 

70 8 

0 1999 

70.8 

70.7 


n = 10, x = 353.5 pounds of feed, y = 70.8 pounds gain, Sx 2 — 7,334.5, 
Sy 2 = 1,207.5, = 1,927.0. Figure 6.8 displays the scatter diagram 



with his regression, ? = 0.2627A' — 22.1. Crampton called attention to 
the fact that this regression does not contain the origin because feed is 
required to maintain the animal even with no increase in weight. Further- 
more, the sample evidence admits the hypothesis of constant variance in Y 
over the range found, and there is no appreciable error in the measure- 
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ment of X. It would seem correct, therefore, to employ the common 
method used throughout this chapter beginning with section 6.5. 

In this rather common situation, if the ratios, Y/X (feed required per 
pound gain), were used, they would lead to inefficient estimates. The 
regression coefficient in column 3 of table 6.13, 0.1999 pound gain per 
pound of feed (or 19.99 pounds gain per 100 pounds of feed) under- 
estimates the gains of pigs eating heavily and overestimates the others: 
the regression line through the origin docs not depict the behavior of the 
animals. The variance from regression through the origin is 91.28 
instead of the efficient estimate, 87.66, indicating a loss of 4% from the 
information available in the data. 

In the regression, Y = 0.2627 X — 22.1, the ratio of the deviations, 
y/x = 0.2627, is constant; but the ratio, Y/X, increases with A". This is 
clear from dividing both members of the regression equation by X: 


X 


= 0.2627 - 


22.1 

X 


For a pig eating 300 pounds of feed, Y/X = 0.2627 — 22.1/300 = 0 1890 
pound gain per pound of feed, while for one eating 400 pounds, Y/X 
= 0.2075. This means that the rates, Y/X, for two animals are not di- 
rectly comparable unless A r is the same in both. Comparison could be 
made by calculating the expected rates, Y/X , then contrasting the devia- 
tions of actual rates from expected. The better way is by use of adjusted 
Y or else deviations from regression (section 6.7). 

Unless there is reason to believe that the regression passes through the 
origin, the ratio, Y/X, is of dubious value. It may do for rough work, but 
careful experimentation deserves the more efficient statistical method. 


6.15 — Summary. If, in sampling from a bivariate population, Y is 
taken at random for either random or selected values of A' measured 
without error, the regression coefficient is an unbiased estimate of /?, the 
change in Y per unit change in X. If X is subject to error, b is biased, but 
not seriously unless the error in X is large as compared to the range. 
Estimates of 0 are given for homogeneous variance in Y, and for these 
additional cases if the regression contains the origin: (i) standard devia- 
tion increases with X, and (ii) variance increases with X. 

The /-test of null hypotheses about 0 is explained. Fiducial limits are 
calculated for y, b , Y, and a random Y for any X. Methods are described 
for case studies and estimates of differences between individuals and group 
means after adjustment to a common X. 

The role of regression and statistical control in experimental design 
is discussed. SY 2 together with the associated n degrees of freedom is par- 
titioned into ( SYY/n (1 degree of freedom), (£*)>) 2 /Sx 2 (1 degree of free- 
dom) and Sd v . x 2 (n — 2 degrees of freedom). 

Galton’s use of the term, regression, is illustrated. There is a discussion 
of the inefficient use of the rate (or ratio), Y/X. 
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With the six quantities, n, #, Sx®, Sy i 2 , and *Sxy, at hand, the following 
are the principal formulas used: 

1.4 = Sxy/ Sx 2 , the regression coefficient, F on X. 

2. Y = 4X or Y = <3 + 4-F, the regression equation, F on X. 

3. Y — bx, the adjusted value of F. 

4. F = y + # + d v . x , the mean of^ plus the deviation of Y from its 
mean, y , plus the deviation from regression, F — Y. 

5. »VF 2 = Sy 2 + Sy 2 + Sd v . x 2 , the partition of £F 2 into three parts. 

6. (Sxy)*/S# — Sy 2 , the sum of squares attributable to regression. 

7. Sy 2 — (Sxy) 2 / Sx 2 = Sd v . x 2 , the sum of squares of deviations from 
regression. 

8. s v . x 2 = Sd v . x 2 /(n — 2), the variance from regression or variance of 
estimate. 

0. the standard deviation from regression or standard error of 
estimate. 

10. Sy = s y . x /\/n for a fixed set of A r ’s. 

11 s b — s v x /\s'Sx 2 , the standard error of b. 

12. s v . x y/\/n + x 2 /Sx\ the standard error of Y. 

13. Sy x v / l + 1 /n + x 2 /Sx 2 , the standard error of an estimated F. 
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Chapter 7 

CORRELATION 


7.1 — Introduction. Bivariate populations are often interesting, not 
because one quantity is to be predicted from the other, but because of 
the mutual relationship between the measurements. Some evaluation of 
this relation is needed even if there is no reason for thinking of one variate 
as dependent on another. Galton’s proposal of this problem sprang from 
his study of inherited characteristics, a study attaining scientific rank 
during the latter half of the nineteenth century. Biologists and mathe- 
maticians vied with each other in developing the theory and its applica- 
tions. It turned out that the subject of correlation , as it came to be called, 
acquired a group of the concepts of regression, so that in many writings 
the two are scarcely distinguished. 

Correlations among stature, span and cubit (length of forearm) were 
reported by Pearson and Lee (18) who collected data from more than a 
thousand British families. The measurements of brother and sister heights 
in table 7.1 are selected to illustrate one of their tables. Since there is 


TABLE 7.1 

Stature (Inches) of Brother and Sister 
Illustration taken from Pearson and Lee’s sample of 1,401 families. 


Family number 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

Brother, X\ 

71 

68 

66 

67 

70 

71 

70 

73 

72 

65 

66 

Sister, X% 

69 

64 

65 

63 

65 

62 

65 

64 

66 

59 

62 

n * 11 £ x ** 69, 

ft = 

64, 

Sx t * = 

= 74, 

£*2 2 

— 66, 

Sx 1*2 

sss 

39 




r = Sxyx%! \J JSx?) (Sx-i 1 ) 39/ \/ (74)766) ® 0 558. Pearson and Lee’s r * 0.553 


here no distinction between the dependence of the two variates, they are 
designated merely as X\ and X 2 instead of X and Y. You will have no 
difficulty in verifying the calculation of the sums of squares and products. 
The new feature is the correlation coefficient , universally symbolized by r. It 
is a numeralization of that commonly observed similarity among children 
of the same parents, the tendency of the taller sisters to have the taller 
brothers. This is represented graphically by a diagram such as figure 7.1 . 
The value, r = 0.558, reflects the propensity of the dots to lie in a band 

[ 138 ] 
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Fig. 7.1 — Scatter (or dot) diagram of stature of 11 brother-sister pairs, r *=0.558 


extending from lower left to upper right, not scattered randomly over the 
whole field. These points arc confined to an elliptical area with the major 
axis inclined toward the right. Biologically, r = 0.558 measures the 
common inheritance of stature by these siblings. 

EXAMPLE 7.1 — Calculate r = 1 for the following pairs: 

X\: 1, 2, 3, 4, 5 
X 2 : 3, 5, 7, 9, 11 

Represent the data in a graph similar to figure 7.1. 

EXAMPLE 7.2 — Verify r = 0.91 in the pairs: 

Xxz 2, 5, 6, 8, 10, 12, 14, 15, 18, 20 
X 2 : 1, 2, 2, 3, 2, 4, 3, 4, 4, 5 

Plot the elliptical band of points. 

EXAMPLE 7.3— In the following, show that r = 0.20: 

Xu 3, 5, 8, 11, 12, 12, 17 
X t : 11, 5, 6, 8, 7, 18, 9 

Observe the scatter of the points in a diagram. 

EXAMPLE 7.4— In the apple data of table 6.5, Sx* = 924, Sy* - 1,222, Sxy 
—936. Calculate r = —0.88. 


7.2 — The correlation coefficient, r. The expression, Sx 1 * 2 /%/ (Sx 1 2 ) ( Sx a 2 ), 
is designed to vary between — 1 and + 1 according to the closeness of the 
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relationship in the population sampled. Negative values of r, like those 
of by indicate an inclination of the ellipse of points downward toward the 
right, large values of one variate being associated with small values of the 
other. To help you acquire some experience of the nature of r, a number 
of simple tables with the corresponding graphs are displayed in figure 7.2. 
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In A, the nine points lie on a straight line, the condition for r = 1. 
The line is a “degenerate” ellipse — it has length but no width. In this 
relation between the two variates they keep in perfect step, any change in 
one being accompanied by a proportionate change in the other. B depicts 
some deviation from linearity, the ellipse being long and thin with r 
slightly reduced below 1. In C, the ellipse widens, then reaches circu- 
larity in D where r = 0. This denotes no relation between the two variates. 
E and D show negative correlations tending towards —1. Summarizing: 
the thinness of the ellipse of points exhibits the magnitude of r, while 
the inclination of the axis to the right or left shows its sign. It should be 
observed that the slope of the axis is determined by the scales of measure- 
ment adopted for the two axes of the graph and is therefore not a reliable 
indicator of the magnitude of r. It is the concentration of the points near 
the axis of the ellipse that signifies high correlation, especially if the regres- 
sion is large. 

I think you will agree that the larger correlations, either positive or 
negative, are fairly obvious from the graphs. It is not so easy to make a 
visual evaluation if the absolute value of r is less than 0.5; even the direc- 
tion of inclination of the ellipse may elude you if r is between —0.3 and 
+ 0.3. In these small samples a single dot can make a lot of difference. 
In Dy for example, if the point (26, 0) were changed to (26, 9) the corre- 
lation would be increased to 0.505. This emphasizes the fact that sample 
r’s from a bivariate population in which the correlation is p (the Greek 
rho distinguishes the parameter from the statistic) are quite variable if n 
is small. The sampling distribution of r will be considered in section 7.4. 
In assessing the value of r in a table, select some extreme values of one 
variate and note whether they are associated with extreme values of the 
other. If no such tendency can be detected, r is likely small. 

Perfect correlation (r = 1) rarely occurs in biological data, though 
values as high as 0.99 are not unheard of. Each field of investigation has 
its own range of coefficients. Characteristics inherited like height ordi- 
narily have correlations between 0.35 and 0.55. Among high school 
grades r averages around 0.35 (3). Pearson and Lee got “organic correla- 
tions,” that is, correlations between two such measurements as stature 
and span in the same person , ranging from 0.60 to 0 83. Brandt (2) calcu- 
lated the correlation, 0.986, between live weight and warm dressed 
weight of 533 swine. Evvard, et al . (6) estimated r = —0.68 between 
average daily gain of swine and feed required per pound gained. 

It is clear that judgment about the size of a correlation should be made 
in the light of similar correlations encountered in the same field, some- 
times with little reference to the theoretical limits, =*= 1 . 

7.3 — Calculation of r with coding. If the data have been coded 
(section 5.4), the correlation coefficient may be computed directly from 
the coded sums of squares and products. Unlike the regression coefficient, 
r does not require the decoding of the calculated values. For example, 
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in table 6.8, r = 735.2/\/(674.1)(832.4) = 0.98. The same result would 
attend the use of the decoded values, r = 1 , 470. 4/\/(26. 96) (83,240). 


EXAMPLE 7.5 — Thirty students scored as follows in two mathematics achieve- 
ment tests: 


I 73 41 83 71 39 60 51 41 85 88 44 71 52 74 50 

II 29 24 34 27 24 26 35 18 33 39 27 35 25 29 13 


42 85 53 85 44 66 60 33 43 76 51 57 35 40 76 

13 40 23 40 22 25 21 26 19 29 25 19 17 17 35 


Calculate r = 0.774. 

EXAMPLE 7.6 — Grout (10) measured the wing length and tongue length, both in 
millimeters, of 44 bees: 


Wing 9 68 9 81 9 59 9 68 9 84 9 59 9.61 9 55 9.25 9 08 9 70 

Tongue 6 53 6.71 6 70 6 69 6 70 6 62 6 59 6 55 6 35 6 25 6 61 


9 60 9 50 9 74 9 72 9 64 9 73 9.77 9 72 9 54 9 65 9 74 

6 51 6 55 6 74 6 75 6 45 6 75 6 70 6 65 6 68 6 77 6 44 


9 59 9 71 9 56 9 61 9 61 9 55 9 78 9 74 9 48 9 71 9 20 

6 54 6 64 6 55 6 57 6 61 6 64 6 64 6 63 6 62 6 55 6.22 


9 53 9 74 9 67 9 56 9 49 9 64 9 45 9 52 9 58 9 60 9 68 

6 43 6 67 6 68 6 62 6 71 6 70 6 50 6 41 6 50 6 62 6 69 


Code by subtracting 9 from each wing length and 6 from each tongue length. Calculate 
n - 0.731. 

EXAMPLE 7,7 — Calculate r = 0 96 between percentage reduction in leaf area 
and reduction in yield of shelled corn, example 6.4. Code the first set by dividing each 
item by 16%. Don’t forget the pair (0, 0). Contrast the selection of percentage reduc- 
tion in leaf area with the specifications of section 7.6. 

EXAMPLE 7.8 — After reviewing examples 17, 18, and 19 of chapter 5, prove that 
if X' «= a + bX and Y' ~ c -|- dY , then r x > v > = =*= r xv • 

7.4 — Relation between the coefficients of correlation and regression. 

If A r 2 were designated as dependent, its regression on X\ would be bn 
= Sxix^/Sx-?) but if Xi were dependent, the regression of X\ on X 2 would 
become b 12 = Sx\X 2 /Sx<?. These expressions differ in their denominators 
only. The regression lines corresponding to the two circumstances are 
drawn in the several diagrams of figure 7.2. If r is near =±= 1 the lines are 
close together, becoming identical in A; if r = 0, they are perpendicular. 
Note: if X 2 is the independent variate, the slope, £ 12 , is measured from 
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the X\ axis, vertical in the figures. The regression of Xi on X* is always 
the line that makes the lesser angle with the vertical axis. 

Now, it is easily seen that 

Sx 1*2 Sx 1*2 (&* 1* 2 ) 2 

D 9 1 V 1 5 *— " . — — = f* 

Sx I 2 Sx 2 2 kSa:i 2 *5x2 2 


Thus, 

r = \/ biib 12 

In other words, r is the geometric mean of the two regression coefficients 
that might be calculated: it is intermediate in value between them. Let 
us illustrate. 

If we were interested in estimating the statures of sisters from those of 
their brothers, we should have (table 7.1), 

bn = 39/74 = 0.527 inch sister’s height per inch brother’s, 

graphed as AB in figure 7.1. On the other hand, if the interest were in 
estimating brothers’ heights, 

b 12 = 39/66 = 0.591 inch brother’s height per inch sister’s, 

the regression line being CD. Usually, however, it is the mutual relation- 
ship that attracts attention because this is taken to represent the herita- 
bility of stature, so we calculate 

r = V (0.527) (0 .591) = 0.558, 

an average of the two regression coefficients. 

This serves to clarify the relation of the two coefficients, correlation 
and regression, in measuring relationship. The latter is the appropriate 
one if one variate, F, may be designated as dependent upon the other, X. 
Fmay partly be controlled or caused by X , as when the available amounts 
of some glandular secretion cause differences in the sizes of organisms. 
Or, Y may be subsequent to X, as weight gain in nutrition experiments 
follows the measurement of initial weight. In such cases, the regression 
of Y on X is usually the statistic that furnishes the information desired. 
It is then appropriate to attempt to estimate the value of Y from a knowl- 
edge of the corresponding value of X. Correlation, on the other hand, 
is the appropriate measure of the relation between two variates like statures 
of sister and brother. The two heights are known to be associated through 
the complex mechanism of inheritance, but neither rnay be looked upon 
as a consequence of the other. In this sense correlation is a two-way aver- 
age of relationship, while regression is directional. Of course, there are 
many variates whose relation may be studied by means of either correla- 
tion or regression, or both. It is necessary only to keep clearly in mind 
the character of the relation being considered. 
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A characteristic of r is revealed by dividing both numerator and 
denominator of the ratio by n — 1 : 


r 


Sx 1 X 2 


Sx 1 X 2 

V(Sxi 2 )(Sxf) 


n - 1 



Sx£ 

72—1 


covariance 

vWKsT) 


covariance 

geometric mean of variances 


Correlation is thus seen to be the quotient of two averages of variation: 
one, the covariance of the two measurements, X\ and the other, an 
average of the two variances. It Ls an abstract number measuring covaria- 
tion: if of two related characters each occurs in various sizes, their corre- 
lation is a measure of the extent to which their variations are concomitant. 

Similarly, b takes on new meaning from the changes below: 

b ~ _ \/Sf7\n — 1) __ Sj 

Sx 2 \J Sx 2 Sy 2 \/Sx 2 x/Sx 2 /(n — 1) s x 

In this manner b Ls intimately related to r. Either can be calculated from 
the other if the standard deviations of the two variates are known. 

It is often convenient to use the standard deviation as the unit for 
measuring the deviate, x ~ X — x. The standard deviate , x' = x/s, has 
the average range given in table 5.5 for a = 1 ; that is, of course, if samples 
are randomly drawn from normal populations. As an example, suppose 
two aptitude tests have been applied, one to fourth grade and the other 
to sixth grade pupils. Designate the tests as IV and VI. Now IV may 
have x — 40 and s = 10, while VI has x = 60, s = 20. If pupil A scores 
60 in IV and his brother B is graded 80 in VI, their standard deviates 
(or standard measures), 


„ / 60-40 

A: x' = = 2, 

10 


„ , 80 - 60 4 

B: x — = 1, 

20 


show their relative standing in the tests. 

It is clear that every sample of standard deviates has x f = 0 and 
s' = 1. Take for illustration the set, 

X: 6, 6, 2, 8, 6, 8, 6 
Since the mean is 6, the deviations are 

*: 0, 0, -4|2, 0, 2, 0 
Now, s = 2, so the set of standard deviates is 
V: 0, 0, -2, 1, 0, 1, 0, 
in which x' = 0 and s' == 1 . 
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If you now write the regression of Y on X in the form (section 6.5), 
$ = bx = r . — . x , 

s x 

then decide to use standard measure for each variate, you will have the 
simple form, 

$ = rx '> 

because s v = s x = 1, and r is therefore the same as b. Thus, in standard 
measure r is the regression coefficient, the average change in y per stan- 
dard deviation change in x f . We have already noticed that r is unaffected 
by units of measurement. Now it appears that if we use standard measure, 
also independent of units of measurements, the distinction between the 
correlation and regression coefficients disappears. 

7.5 — Sampling variation of the correlation coefficient. Common 
elements. A convenient way to draw samples from a normal bivariate 
population is by use of an old device called common elements (7). You may 
go back to the random sampling scheme of section 3.3, or to the samples 
already drawn from table 3.1. In a new table, such as 7.2, record some 
convenient number, say three, of the random pig gains. These gains, or 

TABLE 7.2 

Calculation of Three Pairs of Values of the Variates X x and X% Having 

Common Elements 

The elements arc pig gains from table 3.1 



Pair 




Elements 





1 


23’ 




23 





44 


common 

— > i 

44 





43] 




43 





371 

J 


different 

/ 30 
l 33 




X] 

« 147 




173 

- Xi 


2 


40] 



| 

f 40 





16 

- « — 

common 


16 





19 




[ 19 





30] 

_J 

* - 

different 


' 29 
13 




X\ 

« 105 




117 

= X* 


3 


23] 




f 23 





38 


common 

—4 - 

38 





37 




37 





30] 

J 

► - 

different 

-1 

r 3i 
; 41 




X , 

- 128 




170 

- Xt 
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elements , are written twice in the table. Then continue the drawing, 
adding, for example, one more randomly drawn gain to the lefthand 
column, and two more to the right. The sums constitute the paired values 
of A\ and AV Three such pairs arc computed in the table. It is clear that 
there is a relation between the two sums in each pair. If the three common 
elements all happen to be large, then both A\ and X 2 are likely large, 
irrespective of the extra elements contained in each. Naturally, owing 
to the noncommon elements, the relation is not perfect. If you continue 
this process, drawing a hundred or more pairs, then compute the correla- 
tion, you will get a value of r not greatly different from the population 
value, 

P = 3M4K5) = 0.67 

The numerator of this fraction is the number of common elements, while 
the denominator is the geometric mean of the total numbers of elements 
in the two sums, X\ and X 2 . A formula may be written for this parameter. 
If niz represents the number of common elements, with ri\ and ti 2 desig- 
nating the total numbers of elements making up the two sums, then the 
correlation between these two sums is theoretically, 

P = n^/\/rhn2 

Of course, there will be sampling variation in the values calculated from 
drawings. You may be lucky enough to get a good verification with only 
10 or 20 pairs of sums. With 50 pairs we have usually got a coefficient 
within a few hundredths of the expected parameter, but once we got 0 28 
when the population value was 

ni2/Vn\n 2 = 6/V (9) (16) = 05 

If you put the same numbers of elements into A’i and A' 2 , then n\ = n%. 
Denoting this common number of total elements by w, the parameter may 
by symbolized by 

P = ri 12/ n, 

the ratio of the number of common elements to the total number in each 
sum. In this special case, the correlation coefficient is simply the fraction 
of the elements which are common. Roughly, this is the interpretation 
of the sister-brother correlation in stature (table 7.1), usually not far 
from 0.5: an average of some 50% of the genes determing height are com- 
mon to sister and brother. 

Another illustration of this special case arises from the determination 
of some physical or chemical constant by two alternative methods. As 
an example, consider the estimation of the potassium content of the ex- 
pressed sap of corn stems as made by two methods, the colorimetric and 
the gravimetric (16). Two samples are taken from the same source, one 
being treated by each of the tw r o techniques. The common elements in the 
two results must be associated with the actual weights of potassium in the 
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pair of samples. Extraneous elements may be attributed to the differ- 
ences in the accuracy with which the potassium is determined by the two 
procedures, as well as to the random variation in the two samples. The 
correlation between 24 pairs of determinations was 0.87. On the assump- 
tion that each determination is the sum of some constant number of 
elements chosen at random from a homogeneous population, it may be 
asserted that 87% of the elements in the two determinations are common. 
Thus, 87% of the elements are associated with the actual weight of potas- 
sium in the sap, the remaining 1 3% being attributed to errors of measure- 
ment and sampling variation. 

A less specific illustration is contained in the correlation, r = 0.70, 
between the weights of cob and grain in 250 ears of Country Gentleman 
sweet corn (11). If you are willing to postulate an equal number of ele- 
ments determing cob weight and grain weight, then 70% of them my be 
common. But what are they? Some of them inherited, perhaps, and some 
environmental. 

The reader w r ill have no difficulty in extending this list of illustrations. 
The disturbing necessity is to keep in mind the limitations. The elements 
in the two variates must not only be equal in number, but must be drawn 
from a normally distributed population. Further, the effects of the ele- 
ments must be describable by a sum. If one element, included by chance, 
should double the effects of other elements, then the interpretation would 
be faulty. Despite these limitations, the careful thinker can greatly 
enhance his ideas of correlation by this concept of common elements. 

Unless there is the same unit of measurement in X\ and X 2 , there is 
little hope of applying the theory of common elements. For instance, in 
Dungan’s study of hail injury to maize (example 6.4), while both the 
variates are percentages, yet one is based on areas, the other on weights. 
It would be hard to specify the nature of the common elements, though 
undoubtedly there are some. 

When you have carried through one or more of the calculations of 
r with common elements, you are well aware of the sampling variation of 
this statistic. However, it would be too tedious a task to compute enough 
coefficients to gain even a rough picture of the distribution curve. This 
must be left to the mathematicians to be done from theoretical considera- 
tions. In figure 7.3 are the curves for samples of 8 drawn from populations 
with correlations zero and 0.8. Even the former is not quite normal. 
The reason for the pronounced skewness of the latter is not hard to see. 
Since the parameter is 0.8, sample values can exceed this by no more 
than 0.2, but may be in defect of it by as much as 1 .8. Wherever there is a 
limit to the variation of a statistic at one end of the scale, with practically 
none at the other, the distribution curve is likely to be asymmetrical. 
Of course, with increasing sample size this skewness tends to disappear. 
Samples of 300 pairs, drawn from a population with a correlation even 
so great as 0.8, would have little tendency to range more than 0.05 on either 
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~i.o - o.e> -O.G ~0 4- ~O.Z O OZ 0 4 0<2> OB 1.0 

VAI-.UIIS OF r* 

Fig. 7.3 — Distribution of correlation coefficients in samples of 8 pairs drawn fiorn two 
normally distributed bivariate populations having the indicated values of p. 

side of the parameter. Consequently, the upper limit, unity, would not 
constitute a restriction, and the distribution would be almost normal. 

EXAMPLE 7.9 — Suppose all the elements of Xi are included in Xi. Then nn = n\ 
and p — \fn\]n% This is the relation that may exist when the whole, X 2 , is correlated 
with one of its parts, X\. An example is the correlation between the live weight and 
warm dressed weight of swine (1). Among the 533 animals studied, the mean live 
weight was 222.6 pounds while the mean dressed weight was 185.1 pounds. If one 
assumes that each pound of dressed weight represents a common element, then r might 
be an estimate of >/l85 1 /22l 6 = 0 91. Compare this with the sample r *■ 0 986 and 
the discussion of section 7 11. 

EXAMPLE 7.10 — In a tea plantation (5), the production of 16 plots during one 
14* week period was correlated with the production of the same plots in the following 
period of equal length. The correlation coefficient was 0.91. Can you interpret this 
in terms of common elements? 

7.6 — Estimating the correlation coefficient: fiducial statements and 
tests of hypotheses. If random samples are drawn from a normal bivariate 
population with correlation, p, the statistic, r, though not unbiased (8), 
is an appropriate estimate of p. Errors in either X or Y will further bias 
the coefficient. Unlike regression samples, neither X nor Y may be 
selected. You see, then, that in samples such as the soybean heights of 
table 6.3, the correlation coefficient provides only a doubtful estimate of p 
because X is selected. This is a marked distinction between samplings 
for estimating p and p. 
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Fisher showed also that the null hypothesis, p = 0, may be tested by 
use of / whose sample value is 

t = r\/(n — 2)/(l - r 2 ), d.j. = n — 2 
For illustration, assume that the r = 0.597 in C of figure 7.2 is randomly 
drawn under the conditions specified above. What is the probability of a 
larger r in sampling from p = 0? We have 

t = 0.597\/(9 - — 2)/(l - 0.5 97 2 7 = 1.969, dj . = 7 
Interpolation in the /-table elicits the approximate probability, 0.09, of 
a larger value of /. Hence, a greater value of r could happen about 9 
times per 100 in sampling from p = 0, and the null hypothesis would not 
likely be rejected. This throws light on the difficulty (section 7.2) of 
graphical evaluation of small correlations — they may be no more than 
accidents of sampling. 

The test of this null hypothesis, p = 0, is made at sight in table 7.3. 


TABLE 7.3 

CORRFI ATION COEFFICIENTS AT THE 5% AND 1 % Lf.VFI S OF SIGNIFICANCE 


Degrees of 
Freedom 

5% 

i% 

Degrees of 
Freedom 

5% 

i% 

1 

997 

1 000 

24 

.388 

496 

2 

950 

.990 

25 

.381 

.487 

3 

878 

959 

26 

374 

478 

4 

811 

917 

27 

367 

470 

5 

.754 

874 

28 

361 

463 

6 

707 

834 

29 

.355 

456 

7 

666 

798 

30 

349 

449 

8 

632 

765 

35 

325 

.418 

9 

602 

735 

40 

304 

393 

to 

576 

.708 

45 

288 

.372 

11 

553 

684 

50 

273 

.354 

12 

532 

661 

60 

250 

.325 

13 

514 

.641 

70 

.232 

.302 

14 

497 

623 

80 

.217 

.283 

15 

482 

606 

90 

.205 

267 

16 

468 

590 

100 

195 

.254 

17 

456 

575 

125 

.174 

228 

18 

444 

561 

150 

.159 

208 

19 

433 

549 

200 

.138 

181 

20 

423 

.537 

300 

.113 

.148 

21 

.413 

526 

400 

098 

.128 

22 

404 

515 

500 

088 

115 

23 

396 

505 

1,000 

062 

081 


Portions of this table were taken from Table VA in “Statistical Methods for Re- 
search Workers” by permission of Professor R. A. Fisher and his publishers, Oliver 
and Boyd. 


Simply look along the row for dj. — 7 and observe the position of the 
sample r relative to the tabular values. Our r = 0.597 is considerably 
less than the 5% level, 0.666, leading to the same conclusion as before. 
The test is made without considering the sign of r. Among the following 
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, observe particularly how conclusions 

are affected by both 

sample size 

and the size of r: 



Number of Decrees of 


Conclusion About 

Pairs 

Freedom 

r 

Hypothesis, p *= 0 

20 

18 

0 60 

Reject at 1 % level 

100 

98 

0 21 

Reject at 5% level 

10 

8 

0 60 

Not rejected 

15 

13 

-0 50 

Not rejected 

500 

498 

-0 15 

Reject at 1 % level 


Those who wish a sketch oj the main features of statistics at first reading n. ay well omit the 
remainder of this chapter , together with all of the next . Go to chapter 9 if you wish to learn more of 
enumeration statistics , or to chapter 10 for the elements of analysis of variant e . 


In figure 7.3 was shown the asymmetry of distributions of small 
sample r’s from populations with large values of p. Only under the hy- 
pothesis, p = 0, can the /-distribution be used precisely for testing sig- 
nificance. It is unsuited for testing other null hypotheses, such as p — 0.5 
for example, or pi — p 2 = 0. Equally, t cannot be used for making 
fiducial statements about small sample correlations. A convenient and 
sufficiently accurate solution of these problems was provided by Fisher (9) 
who devised a transformation from r to a quantity, z, distributed almost 
normally with variance, 



“practically independent of the value of the correlation in the population 
from which the sample is drawn.” The relation of z to r is given by 

Z — H[log,(l + r) - Iog«(l - r)], 

but figure 7.4 enables us to change from one to the other with sufficient 
accuracy for ordinary sampling. Following are some examples of the use 
of z- 

1 . It is required to set fiducial limits to the value of p in the population from 
which a sample r has been drawn . As an example, consider r = —0.889, 
based on 9 pairs of observations, figure 7.2F. From figure 7.4, z ~ 1.417 
corresponds to r = 0.889. Since n — 9, s z = \/l/(9 — 3) = 0.408. Now, 
Z is distributed almost normally, independent of sample size, so t oi 
= 2.576, corresponding to df. — oo . This value may be taken from table 
3.8, or approximately from the table of probability in the normal distri- 
bution, table 8.6. The 99% fiducial limits, then, are 
h = 1.417 - (2.576) (0.408) = 0.366, and h = 1.417 + 1.051 = 2.468 
Returning to the figure, and restoring the sign, the corresponding limits 
to p are 


h = -0.350, h = -0.986 
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Emphasis falls on two facts: (i) in small samples the estimate, r, is not 
very reliable; and (ii) the limits are not equally spaced on either side of r, 
a consequence of its skewed distribution. 

2. Occasionally there is reason to test the hypothesis that p hat, some particular 
value , other than zero , in the sampled population (p — 0, you recall, is tested 
by use of table 7.3). An example was given in section 7.5, where r = 0.28 
was observed in a sample of 50 pairs from p = 0.5. What is the prob- 
ability of a larger deviation? For r = 0.28 take from the figure z = 0.288, 
and for p ~ .5, z = 0.549. The difference, 0.549 — 0.288 = 0.261, has 
the standard error, 1 /y/n — 3 = 1 / \/ 47 = 0.1459. Hence, t = 0.261/ 
0.1459 = 1.80, with df. = oo, does not reach the 5% level: the sample 
is not as unusual as 1 in 20. 

3. To test the hypothesis that two sample values of r are drawn at random from 
the same population , convert each to z, then test the significance of the 
difference between the two z’s. For two lots of pigs the correlations be- 
tween gain in weight and amount of feed eaten are recorded in table 7.4. 

TABLE 7.4 

Test of Significance of the Difference Between Two Correlations of Gain 
With Feed Eaten Among Swine 


Lot Pigs in Lot r z 

1 5 0 870 1 333 

2 12 0 560 0 633 

Difference — 0 700 

si - \W6\T = 0.782. / - 0 700/0.782 - 0.895. D f 


Sum 

as 0O . 


1 /(« - 3) 

0 500 
0 111 


- 0 611 


P » 0.37 


The difference between the ^-values, 0.700, has the variance, 


n\ — 3 


+ 


n 2 


i + 1-0.0,, 


The test is completed in the usual manner, calculating t as the ratio of 
the difference of the -t’s to the standard error of this difference. With 
P = 0.37 there is clearly no reason to reject the hypothesis that the £’s 
are from the same population, and hence that the r’s are from a common 
population correlation. 

4. To test the hypothesis that several r's are from the same p , and to combine 
them into an estimate of p. There is often occasion to think that several 
sample correlations are drawn from a common p. If this null hypothesis 
is not rejected, then it is appropriate to combine the r’s into an estimate 
of p more reliable than that afforded by any of the separate r’s. Lush (15) 
was interested in an average of the correlations between initial weight 
and gain in 6 lots of steers. The computations are shown in table 7.5. 






Fig. 7.4 — Corresponding values of r and t* 
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TABLE 7.5 

Test of Hypothesis of Common p and Estimation of p . Correlations Between 
Initial Weight and Gain of Steers 


Samples 

! No. 

*= n 

Df. - 
n — 3 

r 

i 

z 

Weighted z 
« (n - 3k 

Weighted 

1 Square 
■= (n - 3k 2 

Cor- 

rected 

1 

1927 Herefords 

4 1 

1 

0 929 

1.651 

1 651 

2 726 

1.589 

1927 Brahmans 

13 

10 

0 570 

0 648 

6.480 

4.199 

0 633 

1927 Backcrosses 

9 

6 

0 455 

0 491 

2 946 

1 446 

0 468 

1928 Herefords 

6 

3 

-0 092 

-0 092 

-0 276 

0 025 

-0 055 

1928 Brahmans 

11 

8 

0.123 

0 124 

0 992 

0.123 

0 106 

1928 Backcrosses 

14 

11 

0.323 

0 335 

3 685 

1 234 

0 321 

57 

39 



15 478 

9 753 

14 941 


Average £ = 0.397 

6 145 

C = 0 383 


Average r 

* 0 377 

X 2 = 3 608 

r = 0 365 


Each z is weighted (multiplied) by the reciprocal of its variance, so that 
small samples have little weight. The sum of the weighted ^’s, 15.478, 
is divided by the sum of the weights, 39, to get the average z = 0.397. 
The next column contains the calculations that lead to the test of the 
hypothesis that the six sample correlations are drawn from a common 
population correlation. From the sum of the quantities, (n — 3 )z 2 , 
obtained by multiplying each z by the corresponding (n — 3 )c, is sub- 
tracted a correction, 

(S(n ~ 3) z ) (average z) = (15 478)(0.397) = 6.145, 
the result being a value of chi-square. This value, 3 608, is different 
from those tabulated in chapter 1 in that it is based on 5 degrees of 
freedom (less by one than the number of lots) instead of 1 . If you look 
forward to table 9.1 you will find this new distribution in the fifth line, 
with our present value lying between 3.000 (P — 0.70) and 4.351 (P 
= 0.50). Certainly there is no reason to suspect that the 6 correlations 
may not have been drawn from the same population. This being true, it is 
appropriate to ask their average value. This is got by taking from figure 
7.4 the correlation 0.377 corresponding to the average z — 0 397 . Don’t 
fail to note the great variation in these small sample correlations. 

Fisher pointed out that there is a small bias in £, each being too large by 

P 

2{n - 1) 

You see that the bias usually may be neglected in all the foregoing calcu- 
lations. It might be serious, however, if large numbers of correlations 
were averaged, because the bias accumulates, one bit being added with 
every z • It would be easy to subtract the bias from each z were the popu- 
lation p not involved. If there is need to increase accuracy in the calcula- 
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tion of table 7.5, the average r = 0.377, may be substituted for p; then 
the approximate bias for each z may be deducted, and the calculation of 
the average z repeated. Since this will decrease the estimated r, it is 
well to take p slightly less than the first average. For instance, it may be 
guessed that p = 0.37, then the correction in the first z is 0.37/2(4 — 1) 
— 0.062, and corrected z is 1.651 — 0.062 = 1.589. The other corrected 
c’s are in the last column of the table. The sum of the products, 

S(n — 3) (corrected z) = 14.941, 

appearing in the calculating machine with no intermediate recording, is 
divided by 39 to get the corrected mean value of z, 0.383. The correspond- 
ing correlation is 0.365. Before straining to reach precision remember 
two facts: the true p is known only approximately, and the values of 
r and z are read from a graph inaccurate in the third decimal place. For 
tables of the distribution of r, sec reference (4). 

EXAMPLE 7.11 — To get an idea of how the selection of pairs affects correlation, 
try picking the 5 lowest values of test II (example 7.5) together with the 6 highest. 
The correlation between these 1 1 scores and the corresponding scores on test I turns 
out to be 0 89, indicating the unreality that can be introduced into r by non-random 
choice. 

EXAMPLE 7 12 — Set 95% fiducial limits to the correlation, 0.986, n ** 533, 
between live and dressed weights of swine. Ans. 0.983 — 0.988. 

What would have been the fiducial limits if the number of swine had been 25? 
Ans. 0.968 - 0.994. 

EXAMPLE 7.13 — The correlation, 0.731, between wing and tongue length in 
bees (example 7 6) was one among 4 similar correlations calculated by Grout, the 
other three being 0 354, 0.690, and 0.740, each based on a sample of 44. Test the hy- 
pothesis that these are samples from a common p. Ans. x 2 = 9 164, dj. — 3, P — 0.03. 
In only about 3 trials per 100 would you expect such disagreement among 4 correla- 
tions drawn from a common population One would like to know more about the dis- 
cordant correlation, 0.354, before drawing conclusions. 

EXAMPLE 7.14 — Estimate p in the population from which the 3 bee correlations, 
0.73L 0.690, and 0 740, were drawn Ans. 0.721. 

EXAMPLE 7.15 — Set 99% fiducial limits on the foregoing bee correlation. Note: 
r *= 0.721 is based on 3 X 41 =123 dj. It is therefore equivalent to a single r from a 
sample of 123 + 3 = 126 bees. The fiducial limits: 0.590 — 0.815. 

EXAMPLE 7 16 — Although there is evidence against the hypothesis that the 4 
bee correlations were drawn from a common p, it would not be appropriate ordinarily 
to ask if the small one differs significantly from the average of the 3 larger. For sake of 
illustration, however, let us test the hypothesis that r = 0.354, which might have been 
derived under one set of circumstances, is randomly drawn from the same p as the 
average of the other 3, which could have come from another hive, say. You have 
found that these 3 larger correlations average 0 721. Show that this differs significantly 
from the small correlation. Ans. x 2 — 8.97, dj. = 1, P < 0.01. 

7.7 — Correlation and regression. Every pair of columns in the apple 
data of table 6.5 will yield a correlation coefficient, some of them of 
notable interest. Consider the series of equal correlations, 

Txr = r xv = Tyy = Ty $ = T yi 

Each of the corresponding pairs of columns consists of coded values of 
another pair so that these correlations are all the same (section 7.3). 
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As examples: (i) The deviations, x and y, are coded values of X and F 
got by subtracting x from each X and y from each Y ; thus, r xy = r XY . 
(ii) Since ¥ = a + bX then ¥ is a coded X, each X being multiplied by b 
and the product increased by a ; hence, the correlation between X and Y 
is equal to that between ¥ and Y. 

This correlation between Y and ¥ is especially enlightening. Restating 
it, we may say that the correlation between Y and its regression estimate 
is identical with that between F and X. This means that if F is uniformly 
near the regression line then r Y y and its equal, r XY , are close to =*= 1 . Thus, 
the value of r XY is, in this specific sense, a measure of success in estimating 
Y by means of regression. 

Another correlation worth noticing is the one that you may compute 
from the columns, A' and Y. Since ¥ is the coded value of A", this correla- 
tion between X and its own coded value is =*= 1 w r ith sign the same as that 
of b. Rounding errors may prevent you from verifying this exactly. The 
meaning of this correlation is that ¥ changes in steps proportional to 
changes in X , the successive points, (A r , T), all lying exactly on the 
regression line. 

The correlations of d v . x with x and y may also be calculated and inter- 
preted. The first, r xA ^ * = 0, recalls one of the assumptions made in 

fitting this regression: the standard deviation of d v . x is, in the population, 
uniform throughout the range, and therefore is uncorrelated with *. The 
second, fyd = y/l — r Ty 2 , may be visualized in this manner: if the re- 
gression is zero, the regression line is horizontal, and d y . x coincides with y; 
that is, fyd is unity when r xv = 0. At the opposite extreme, as the 

points cluster more and more closely around the line, r xy approaches 1; 
and since all the deviations, d y . X) approach zero, there is nothing forj> to 
keep step with and r vd approaches zero. 

It will be observed that in this section r has been treated merely as a 
calculated quantity without reference to estimation or probability state- 
ments. The formulas giving the relation of correlation to regression would 
be just as true of the soybean data (table 6.3) as of the apple data despite 
the fact that, on account of the selection of X , the correlation between 
time and height may not be an informative estimate of the population 
parameter. This distinction between r as an estimate of p and r as a 
convenient computational device must be observed carefully. Not only 
in the present section but also in the three to follow, r is used merely as a 
fraction related to a group of regression concepts. Whether or not it be 
considered an estimate of p depends on whether the sampling conditions 
of section 7.6 have been met. 

EXAMPLE 7.17 — Prove that the correlation of d v , x with x is zero Hint, using 
deviations, d v . x — y — bx. Then the sum of products, Sxd u . x = Sxy — bSx 2 Since 
b « Sxy/Sx 2 , this reduces to 0. 

EXAMPLE 7.18 — Prove that the correlation of d v . x with y is V'l — r x f. 
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7.8 — Correlation and the partition of Sy 2 . The sum of squares 
attributable to regression (section 6.12) is readily changed as follows: 




(%)» 
s*sy ' 


sy = r 2 sy 


That is, r* is the fraction of Sy~ due to regression. The remainder is 


sd y j — sy — r\sy = (i - T')sy 


You see how intimate is the relation between r and the partitioning of 
Sy 2 in regression, r 2 is the fraction of Sy 2 associated with the correlated 
changes in Y and X while (1 — r 2 ) is the remaining fraction quite inde- 
pendent of X. 

If you can bring together these ideas and consider them in relation to 
figure 6.3 and table 6.5, it is illuminating to study the manner in which 
this entire complex is affected by different values of r. Take, for example, 
the extreme case in which r — 0, and therefore 1 — r 2 = 1. None of the 
variation in percentage wormy apples would then be accounted for by 
differences in crop size. The regression line would be horizontal, cutting 
the vertical scale at the point y = 45%. The regression percentages in the 
table would be all the same and equal to y. The deviations from regression 
would be simply deviations from mean value. The computation of s v . x 
would be identical with that of s V) except for degrees of freedom. 

The opposite extreme is that in which r = =*= 1 and 1 — r 2 = 0. Now all 
the variation in wormy apples would be attributable to regression, every 
dot in the figure would lie on the regression line, the regression per- 
centages would be identical with the corresponding observed values, and 
the deviations from regression together with the standard deviation from 
regression would all be zero. 


7.9 — Correlation, common elements, and regression. Common ele- 
ments help to clarify some of the facts of regression. To understand this 
one must make a slight alteration in the ideas presented in section 7.5. 
In table 7.2, think of four elements being drawn, their sum constituting 
AY Transfer three of them, taken at random, to X 2 ; then add two more. 
The resulting value of X 2 is controlled, to some extent, by the elements 
transferred from AY In this sense, X\ is one of the causes of AY While 
the computation of r is unaffected by this new attitude, the emphasis 
has been shifted to regression. It has been shown (7) that, using x\ and x 2 
as deviations from mean, the regression of x 2 on x\ is 

x 2 = (n n /ni)x x 

the regression coefficient being the ratio of the number of transferred 
elements to the number in *i*. As an example, consider the inheritance 

* It is convenient to think of x\ and x 2 as variates instead of X\ and AT These new 
variates are the sums you would have got in table 7.2 if you had subti acted £ = 30 
pounds from every pig gain in table 3.1. 
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of stature. An average of half the genes controlling height are transferred 
to the son by his father. If this were undisturbed by complicating cir- 
cumstances, one would expect the regression of son’s height on father’s 
to be X 2 = 0.5xi, the expected deviations of sons’ heights from their mean 
averaging just half the deviations of fathers’ heights from theirs. If you 
are a student of genetics, you will know that this coefficient is often less. 

The equation, f 2 = (wi 2 /«i)*i> emphasizes the fact that the estimated 
value, £ 2 , is not affected by the number of elements in x 2 . This estimated 
value depends entirely upon two quantities: (i) the fraction of the ele- 
ments transferred from * 1 , and (ii) the value of the sum, xj. Neither the 
number of elements in x 2 , nor their sum, influences f 2 - The values, x$ 
and i 2 are correlated (section 7.7) only because they contain the nn 
common elements. 

7.10 — Correlation and differences. Any set of paired observations 
might be interesting from either or both of two viewpoints; correlation- 
regression as in chapters 6 and 7, or differences as in chapters 2, 3, and 4. 
Actually, the two viewpoints are not without connections, some of which 
we wish to examine now. 

In section 3.9 you learned about differences between randomly paired 
observations from a normal population with V = 100. Such differences, 
centering on the mean, zero, had double the variance of the items; that is, 
c r D 2 = 2cri 2 . Again, in section 4.2, corresponding facts about differences 
between means were pointed out; the differences clustered around zero 
with variance, 2<ri 2 / n. Now it is necessary to consider differences of a 
more general type. In the difference, X D = X\ — A r 2 , imagine X\ drawn 
from a population with mean, mj, and variance, off 2 , while A r 2 is from m 2 , 
a 2 2 . You could draw samples from this population of differences by 
taking X\ from the pig gams of table 3.1 and X 2 from those of table 3.11, 
deducting for convenience 10 pounds from each of the latter items so as 
to make m 2 — 20 pounds. Such differences would be normally distributed 
with 

m D — ?rii — m 2 = 30 — 20 = 10 pounds 
a D 2 = ai 2 + <r 2 2 = 100 + 25 - 125 

Three samples of this kind are recorded in table 7.6 (the random 
numbers are the same as those used in table 3.9). Of course, there is 
sampling variation in both mean and variance as can be seen by com- 
parison of each with its parameter indicated above. This sampling 
variation leads to a contrast between the relationship among the sums of 
squares in the sample and in the population. In the sample, it is easily 
shown (example 7.24) that 

Sx D 2 = 5V + SX 2 2 - 2SxiXt 

Illustrating with sample 1, 918.4 — 408.9 + 268.1 — 2( — 120.7), and 
similarly in the other samples. Sample 3 was chosen so that .Sxi* 2 = 0, 
and therefore 1,352 = 1,206 + 146. 
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If we substitute for *$* 1*2 its equal, rn\/(Sxi r )(Sxt i )y then divide both 
members of the equation by n — 1 , there results, 

s D 2 = J 1 2 + H 2 - ZrnSiSt 

These equations hold exactly for every set of differences irrespective of 
sampling variation. Why doesn’t the last term appear in the population 
formula for cr D 2 ? For this reason: on account of the random pairing of 
X\ and A’ 2 , the correlation in the population of such pairs is zero. Through 
vagaries of random drawing, a small correlation usually shows up in 
samples, but the sign is as likely to be plus as minus and the sum of a lot 
of such correlations would tend towards zero. 

If you compare the r’s of samples 1 and 2 with those of table 7.3, you 
will see that the calculated values may well be samples from p = 0. 
Naturally, if you continue the sampling, you will get \r\ >0.632 about 
once in each 20 trials. 

The foregoing discussion applies to differences from uncorrelatcd 
populations. What happens if X\ and A ’2 arc drawn from a population in 
which there is a correlation, p? Such a population could result from the 
common elements of section 7.5, and samples can be drawn from it as 
illustrated in table 7.2: that population, you may recall, had a correlation, 
P12 = 0.67. If the difference, X D = A'i — A r 2 , is calculated from each 
pair, then 

od ~ <ri 2 + cr2 2 — 2pi 2 0-i<72, 

a formula that follows the pattern of that for sample differences from un- 
correlated populations. So far as samples are concerned there is no 
distinction between the formulas. In samples of differences from both 
correlated and noncorrelated populations, 

s D 2 = si 2 + s>z 2 — 2r 12 S 1 S 2 

Of course, if the sample is from a correlated population you may expect r 
to be significant, especially if p and n are fairly large. 

This formula for the variance of sample differences explains the effects 
of arbitrary pairing, examined in section 4.7. The randomly paired 
differences in the first two columns of table 3.3 are 1, 22, 23, — 1, 20, 
33, —41, —20, 20, and 6: their variance is, s D 2 = 513.79. In the first 
column of table 7.7 this variance is verified from the formula above. 
You will notice that the correlation is negative; hence, the third term in 
the formula is added to the other two to make up the total, s D 2 = 513.79. 
Now consider the arbitrary pairing of method I, table 4.8. This arrange- 
ment changes the random correlation of —0.59962 to the high positive 
value, 0.97664, with the consequent reduction in variance to 8.01 (middle 
column, table 7.7). The pairing of method II, on the contrary, accentu- 
ates the random negative correlation and so increases the variance to 
s D 2 = 627.35. 

The success of pairing in an experiment depends upon the extent to 
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TABLE 7.7 

Illustrations of the Effect of Correlation on the Standard Deviation of 
Differences. Data From Tables 3.3 and 4.8 



From the First 

Two Columns of 
Table 3.3 

Method I, 

Table 4.8 

Method II, 

Table 4.8 

Tn 

- 0.59962* 

0 97664* 

- 0 95353* 

Sl* 

169 82 



Si 2 

151 57 



j-j* -f j 2 2 

321 39 

321 39 

321 39 

2rnS\S2 

-192 40 

313 38 

-305 96 

SD 2 

513 79 

8 01 

627 35 


* This number of decimals is necessary to verify the arithmetic. 


which positive correlation can he introduced. If the investigator knows 
his material well enough to bring together into each pair individuals 
that will react alike, he will tend to get differences like those of method I 
with a small resultant variation. But if he should try pairing without 
adequate knowledge of his material, he might get a negative correlation 
with the large experimental error of method II. In that case, he would 
have been better off with the group comparison of chapter 4. 

Occasionally it may be useful to know that the correlation in a sample 
of pairs can be calculated from the foregoing formula by rearranging it 
thus: 

si 1 + r 2 2 - s D 2 _ Sx i 2 + Sx 2 2 - Sx D 2 
f12 2sist ~ 2yA&ci*)(SxJ) 

where x D = xi — * 2 , the deviation of X D from its mean. An example 
may be found in the tobacco virus data of table 2.2 where Sx D 2 ~ 130 is 
already known. The sums of squares of the numbers of lesions on the two 
sets of half leaves are easily computed: Sx\ 2 = 468 and »S* 2 2 = 172. 
Substituting, 

468 + 172 - 130 

r 12 = _ - 0.90 

2\/ (468) (1 72) 

The efficiency of this experiment, discussed in section 3.10, arises from the 
similarity of the reaction of the leaf halves to the inoculation, a similarity 
reflected in this high correlation, 0.90. 

Although the chief interest here is in differences, it is useful to observe, 
also, the formula for the variance of the sum, X s = Xi + AV 

s s 2 = *i 2 + J 2 2 + 2ri 2 JV 2 

EXAMPLE 7.19— In table 7 1, subtract each sister height from her brother’s, 
then compute the corrected sum of squares of the differences. From this, together with 
Sx i* «* 74 and 5*2* *= 66, verify rn — 0.558 by means of the formula just above. 
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EXAMPLE 7.20 — Notice that, in the last formula given above, if j£> a * jp + 
as in sample 3, table 7.6, then n* = 0; if sd 2 < fi 2 + * 2 2 as in sample 2, then n* is 
positive. 

EXAMPLE 7.21 — Haber (11) furnished the following information about 300 ears 
of sweet corn: 



Mean 

Standard Deviation 

Character 

(grams) 

(grams) 

Ear weight, E 

106 55 

27 68 

Cob weight, C 

15 59 


Grain weight, G 

90 96 

24 62 


Observe that £ = C -f G; also, r eQ = 0 9940. Calculate s e ~ 4 19 grams, and r ea 
- 0.6906. 


EXAMPLE 7.22 — If m = 1, show that so — s\ — s*. 

EXAMPLE 7.23 — If m *» —1, show that so = s\ -f~ -f 2 . 

EXAMPLE 7.24 — Assuming successively that the correlation between X\ and Xz 
is first zero then r 12 , derive the corresponding formulas for Sxo* and $xs 2 . 

7.11 — Correlations with sums and ratios. Correlations due to 
common causes. It is often informative to correlate a variate, A3, with 
the sum of itself and another variate, A r 2 ; that is, X\ with X s = X\ + AY 
As an example, there is the correlation, 0.968 (cited in section 7 3) be- 
tween warm dressed weight, A3, and live weight, X s , in swine, the variate 
A3* denoting the weight of discarded portions. Again, one may gain 
information from correlating A3 with X P — X\/X 2 or with X Q ~ XJ 3\3(19). 
An example of r\ P is the correlation between the standard deviation and 
the coefficient of variation; that is, between s and s/x. For illustration, 
50 pairs of these statistics were taken at random from the samples of pig 
gains, section 3.3. The mean of the 50 standard deviations was 9.87, that 
of the means, 29.76. The correlation between s and x was 0.140, an esti- 
mate of p = 0, because these two statistics are theoretically uncorrelated 
in this kind of sampling. But the correlation between s and C was 0.932. 
A familiar instance of r iQ is the correlation of live weight, A3, with dressing 
percent, X Q : 

carcass weight, A3 
v live weight, A3 

An even more complicated relation is found in the correlation, —0.68, 
between average daily gain of swine and feed required per pound gained 
(6), where the gain is in the numerator of the first rate and in the de- 
nominator of the second. 

Having observed some unwarranted interpretations of such correla- 
tions, Karl Pearson dubbed them “spurious” (17), and this rather deroga- 
tory title has led people to distrust them. Of course, it is the interpretation 
that may be spurious. The correlations are on the same footing as any 
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others: they are appropriate estimates of parameters if they satisfy the 
conditions laid down in section 7.6 (1). As for the interpretation, any 
explanation of a correlation between X\ and the sums or ratios involving 
it would seem to require knowledge of r 12 as well. 

An especially simple case, in which r 12 = 0, is illustrated in table 7.8. 

TABLE 7.8 

Correlations of A, With X 8 « X x -f- A 2 and X q - X 2 /X x . and X% Comprise 
Sample 3 in Table 7.6 Where r n = 0 



X\ 

X % 

! X\ 

A* 

X\ 

Xq 


32 

18 

32 

50 

32 

0 562 


31 

13 

31 

44 

31 

0.419 


28 

22 

28 

50 

28 

0 786 


24 

17 

24 

41 

24 

0 708 


44 

11 

44 

55 

44 

0 250 


53 

19 

53 

72 j 

53 

0 358 


9 

16 

9 

25 

9 

1 778 


35 

23 

35 

58 

35 

0 657 


33 

23 

33 

56 

33 

0 697 


31 

18 

31 

49 

31 

0 581 

Sum 

320 

180 

320 

500 

320 

6 796 

Sum of squares 

1,206 

146 

1,206 

1,352 

1,206 

1 59887 

Sum of products 


0 

1,206 

— 37 

516 

Correlation 


0 

0 9445 

- 0 

8543 


The correlation, ri s = 0.9445, is like the one between dressed wieght (Ah) 
and live weight (Ah*) except that here there is no correlation between the 
two parts. It would be interesting to know the correlation between dressed 
weight and the weight of discarded parts. 

The part-whole correlation can be computed from the original samples 
by means of the formula, 

^1 + ^ 12^2 

\Ai 2 + 2r i2i* i^2 + s 2 2 
If ri 2 = 0, as in table 7.8, this formula reduces to 


vV + * 2 2 \/l + S 2 2 /sS \/\ + Sxf/Sx I 2 

The calculation of the part-whole correlation in the table may now be 
verified by substitution from the first sample: 


ri 8 = 


1 

vT+ 14671206 


0.9445, as before 


If, in addition to ri 2 = 0, ^ also, then 


ris = l/y/2 = 0.707 
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This gives some idea of the size of spurious correlations in special cases and 
shows that both the standard deviations are involved, as well as the corre- 
lation between the added variates. 

This part-whole correlation may be considered a special case of com- 
mon elements. A\ is common to both X\ and X s , while X 2 is the different 
part. If X\ and X 2 are drawn from the same normal population, then one 
may put n\ 2 — 1, n\ = 1, n 2 = 2, and each sample correlation is an esti- 
mate of 

p = rii 2 / V nin 2 = l/\/2 = 0.707 

as in the special case of ri 5 above. • 

The correlation, r\ Q = —0.8543, is negative partly because X\ is the 
denominator of the fraction, X Q , so that a large X\ tends to make a small 
Xq. This correlation, also, can be got directly from the sample. The 
formula is an approximate one: 

_ r\ 2 C 2 — Ci 

\/ C 1 2 + 2ri2CiC2 + C 2 2 

where C is the coefficient of variation, s/x> expressed as a fraction (not 
percentage). In the first pair of columns in table 7.8, ri 2 = 0, Ci 2 = 0.1 3090, 
C 2 2 = 0.05007 and Ci = 0.3618. Substituting, 


tiq = 


-0.3618 

V Oif 5 ~OO^^oT30 90 


-0 8505 


The small discrepancy is due partly to the approximate nature of the 
formula. In this ratio correlation, the means as well as the standard 
deviations of the original sample have come into the picture. 

The correlation between two variables may be due, wholly or in part, 
to their common relation to one or more other factors. The organic 
correlations already mentioned are examples. A big animal tends to be 
big all over, so that any two parts arc likely correlated because of their 
participation in the general size. Again, two quantities that change in 
time may show a high correlation. As an example, there is a correlation 
of —0.98 between the birth rate in Great Britain, from 1875 to 1920, 
and the production of pig iron in the United States. The matter was 
discussed by Yule as a question, “Why do we sometimes get nonsense- 
correlations between time series (21)?” Among the answers suggested 
only one is pertinent at this point: pairs of observations ordered in time 
are not randomly drawn in the usual sense and may not come from any 
normal bivariate population. Hence, they may not conform to the 
conditions we have set for estimates and probability statements. For 
another answer, see “partial correlation,” chapter 13. 


7.12 — Correlation of ranks. Occasionally individuals are not meas- 
ured, or cannot be, but are only arranged in order according to some 
criterion. Instead of a measurement, each object has a rank — 1st, 2nd, 
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etc. Two such rankings of the same things, or of related things, led Spear- 
man ( 20 ) to devise a formula for rank correlation, 

6 StP 

r s = 1 ~ -71 7 , 

n{n l — 1 ) 

whose calculation is explained in table 7.9. Like r, the rank correlation 


TABLE 7.9 

Ranking of 7 Rats bv 2 Observers of Their Condition After 3 Weeks 
on a Deficient Diet 


Rat 

Number 

Ranking by 

Difference, 

d 

cP 

Observer 1 

Observer 2 

1 

4 

4 

0 

0 

2 

1 

2 

-1 

1 

3 

6 

5 

1 

1 

4 

5 

6 

-1 

1 

5 

3 

1 

2 

4 

6 

2 

3 

-1 

1 

7 

7 

7 

0 

0 

'T, 

Sd =-- 0 

QO 

I! 

5 


6.SV/ 2 _ _ 6X8 

*(w r - 1) 7(49 ~ 1) 


= 0 857 


can range in samples from —1 (complete discordance) to +1 (.complete 
concordance). Calculation of r s in small samples is much easier than that 
of the product-moment correlation, r. 

The utility of rank correlation seems rather limited. Most judges of 
pigs or bathing beauties feel safer with a score card on which various 
numbers of points are assigned to each quality considered. Total scores 
by two or more judges are ordinarily related by use of r. But if there are 
two rankings to be correlated, Spearman’s formula is appropriate. 

Little is known about the theory of rank correlation except in two 
fields: (A) the population in which p = 0, the two rankings being inde- 
pendent; and (B) the normal bivariate population in which the two 
coefficients are closely related. References (12), (13), and (14) are 
excellent accounts of recent thinking about the problem. 

A. The case of independence. Under the hypothesis, p = 0, samples of 
r s are distributed in a symmetrical fashion that approaches the normal as 
n increases. The test of significance then involves no assumption about 
the form of the population. For samples greater than 8 ranks, the test is 
the same as that of section 7.6, table 7,3, but if n is 8 or less, use table 7.10. 
One observes that significant evidence against the hypothesis cannot be 
found in samples of 4 or less. The correlation of table 7.9 is significant at 
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TABLE 7.10 

Significant Levels in Small Samples of the Rank Correlation Coefficient 
Under the Hypothesis, p * 0 



Size of Sample 

5% Level 

1% Level 


4 or less 

none 

none 


5 

1 

none 


6 

0 886 

1 


7 

0 750 

0.893 


8 

0 714 

0 857 


9 or more 

use table 7.3 



the 5% level but not at the 1%; that is, there are about 2 chances in 100 
of getting a rank correlation greater than 0.857 even if the rats were all 
in the same condition so that the judgments of the observers had no 
foundation. 

B. Samples from a normal bivariate population. In this population, p and p s 
are almost the same, the latter being slightly smaller except for perfect 
or no correlation, in which cases p = p B . As for samples of r s from this 
population, little is known about them unless p = 0. Using common 
elements (section 7.5), I drew 10 samples, each with n = 20, from a 
normal bivariate population with 

n 12 10 

p - — = - — = 0.559, 

y/nin 2 \/(20)(16) 

then calculated r and r s in each. The results are tabulated in table 7.11, 

TABLE 7.11 

Product-Moment and Rank Correlations in 10 Samples From a Normal Bivariate 
Population in Which p «= 0.559 


Sample 

r 

r» 

Sample 

r 

r. 

1 

0 663 

0.505 

6 

0 712 

0 680 

2 

.708 

.738 

7 

627 

474 

3 

505 

.584 

8 

444 

352 

4 

374 

.311 

9 

529 

398 

5 

.536 

.562 

10 

546 

484 

Average 




0.564 

0 512 


the averages being got by use of z (section 7.6). In this small sample of 
coefficients, r s averages a little smaller than r and is somewhat more 
variable. The correlation between the 10 pairs of r and r s is 0.64. There 
is no strong evidence here that one would not be safe in converting his 
measurement data to ranks and computing r s instead of r. For samples of 
40 or less, r s is easier of calculation than is r. However, with good data 
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from approximately normal populations I would spend the extra time it 
takes to get the product-moment correlation whose distribution is exactly 
known. 

But suppose one has a sample from a population whose distribution is 
unknown or is known to be anormal. Then there is some reason to believe 
that he might be better off with r s than with r. My advice would be to 
seek the aid of a mathematical statistician in the hope that a more appro- 
priate method might be found. 

7.13 — Summary. The correlation coefficient is a measure of co- 
variation — the degree to which two variates keep in step as they change. 
The coefficient varies from —1 to +1, the extremes indicating perfect 
linear relationship and zero, independence. The statistic, r, is an estimate 
of the parameter, p, if the sample is drawn randomly from a normal bi- 
variate population: there must be no selection of either variate. Errors 
in measuring either X\ or X 2 decrease the precision of the estimate. 

The hypothesis, p == 0, is tested by 

t = r\f(n ~ 2)/(l - r 2 ),' df. = n - 2 

1 + r 

The transformation, z = 1.1513 log , almost normally distributed 

1 — r 

with variance, 1 / (n — 3), is used for the following: (i) tests of values of p 
different from zero, (ii) fiducial limits, (iii) test of hypothesis that 2 or 
more samples of r are from the same normal bivariate population, and 
(iv) averaging two or more values of r on the assumption that they are 
samples from a common population. 

The fraction, r, is often useful in regression calculations even in cir- 
cumstances where it may not be an estimate of any parameter. 

Correlation may be explained variously: by inheritance, by common 
elements, by organic relations, by the association of two variates with a 
correlated third variate such as time (nonsense correlation), and by the 
inherent relationship of sums or ratios (spurious correlation). 

SSd^ 

Spearman’s rank correlation, given by r s = 1 , is available 

rr — n 

if only rankings are given. It may also be used if there is evidence that 
the population sampled is anormal. 

Intraclass correlation will be presented in chapter 10 and partial corre- 
lation in chapter 13. 
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Chapter 8 

LARGE SAMPLE METHODS 


8.1 — Introduction. The theory of statistics grew up with the notion 
of a normal frequency distribution. Most of the more fundamental con- 
cepts start with the mathematical statement of that same distribution. 
Its name has been mentioned more than once in this text, but no very 
definite description of it has been given. In this chapter will be set out 
so much of the theory as is necessary for the understanding of elementary 
statistics in biology. The applications are rare but occasionally important. 
But before lookiug at the normal distribution in particular, we shall 
describe some methods of computation appropriate in large samples. 

8.2 — Computation of mean and standard deviation of a frequency 
distribution. In section 3.4 you were promised a short method for com- 
puting the statistics of a frequency distribution. That of the 511 means is 
copied in the first two columns of table 8.1. The actual sample means 
arc grouped into classes with midpoints at 19, 20, . . . pounds, the number 
of means in each class being recorded in the second column. For con- 
venience, the arbitrary origin, G , is selected as 29 pounds because that is 
the middle class mark and also because that class carries one of the 
greater frequencies. The code numbers are simply the deviations of the 
class marks from G. In the fourth column are recorded the sums of the 
deviations in the several classes. For example, in the fourth class, class 
mark 22, the deviation is —7 and there are 7 of them: the sum is there- 
fore ( — 7) (7) — —49. At the foot of the column are set down separately 
the sums of the negative and positive deviations together with their total, 
446. 

The sums of the squares of the deviations in the last column are calcu- 
lated conveniently by multiplying/A' by X. Again using class mark 22 as 
an example, ( — 7) ( — 49) = 343 is easier than ( — 7) 2 (7). Merely multiply 
twice by each code number, first / to get fX, then fX to get fX 2 . The 
calculations at the bottom of the table involve no innovations. 

Do you see the identity of the code numbers, A', with those discussed 
in chapter 5? Each mean weight is coded, implicitly, by first subtracting 
29, then rounding. Consider a mean whose value was calculated in 
section 3.4, such as 33.2 pounds. The code is 33.2 — 29 == 4.2, rounded 
to X = 4. In the actual process of making the frequency distribution the 
rounding, accomplished by grouping, was the first step. For example, the 
mean, 33.2 pounds, fell into the class whose mark was 33 pounds. The 
subtraction of the class mark completed the coding. It is easy to verify 

[1G9] 
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TABLE 8.1 

Frequency Distribution of 511 Means of Samples of 10 Items (Table 3.4). 
Computation of Mean and Standard Deviation 
G » 29 pounds 


Sum of 





Sum of 

Squares of 

Class-mark Frequency 

Code Numbers 

Code Numbers Code Numbers 

Pounds 

/ 

X 

fx 

fX 2 

19 

1 

-10 

-10 

100 

20 

1 

- 9 

- 9 

81 

21 

0 

- 8 

0 

0 

22 

7 

- 7 

-49 

343 

23 

5 

- 6 

-30 

180 

24 

10 

- 5 

-50 

250 

25 

19 

- 4 

-76 

304 

26 

30 

- 3 

-90 

270 

27 

41 

- 2 

-82 

164 

28 

48 

- 1 

-48 

48 

29 

66 

0 

0 

0 

30 

72 

1 

72 

72 

31 

56 

2 

112 

224 

32 

46 

3 

138 

414 

33 

45 

4 

180 

720 

34 

22 

5 

110 

550 

35 

24 

6 

144 

864 

36 

12 

7 

84 

588 

37 

5 

8 

40 

320 

38 

0 

9 

0 

0 

39 

1 

10 

10 

100 


n » SJ = 511 


-444 6/A’ 2 = 

= 5,592 




890 





446 


SfX « 

446 " 


SfX* - 

5,592 

SJX/n « 

446/511 

{SfX) 1 / 

n » (446)7511 = 

389 27 

SB 

0 87 


6* 2 = 

5,202.73 

£ « 

G + SfX/n 

s 1 

— Sx 7 (n — 1 ) =* 

5,202 73/510 

= 

29 -f- 0.87 = 29.87 pounds 

- 

10.2014 

! \A 2 /rc « 

V'10.2014 ’511 = 0.141 pound 

S as 

3.194 pounds 


the code numbers for such means as 31.6, 25.1, and 26.8 pounds; namely, 
3, — 4, and —2. How about 30.5? You remember the rule — round to the 
adjacent even number. Hence 30.5 — 29 = 1.5 rounds to 2. But 
31.5 — 29 = 2.5 also rounds to 2. The result is a slight excess of fre- 
quency in the even-numbered classes. In effect, these classes are wider 
than the others. 

Another rule must be brought forward from chapter 5, the one re- 
quiring the range of code numbers to be not less than 20. In new terms, 
there must be at least 20 classes for precise work, preferably more. After 
the computations are done, you can apply this criterion: the standard 
deviation should be at least four times as great as the class interval. You 
will notice that this requirement was not fulfilled in table 8.1, s being 
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only a little more than three times the class interval of one pound. It 
would have been better to make the class interval one-half or three- 
quarters of a pound. 

It is well to keep in mind that the only lack of precision in using code 
numbers derives from the grouping or rounding. This process is usually 
limited to that type of variate known as continuous, an item being likely 
to fall at any point on the scale. There is no sacrifice of precision in the 
computations when the variate results from such an enumeration as that 
of the following example. 

EXAMPLE 8.1 — Edwards (3) planted 20 soybeans on each of 183 agar plates, then 
observed the numbers germinating. The frequency distribution follows: 


Number germinating 
on each plate, X 

6 7 8 9 10 11 12 13 14 15 16 

Number of plates,/ 

5 9 8 19 26 34 26 22 21 10 3 


Compute the mean number of beans germinating on a plate, 1 1 2, together with the 
standard error, 0.17 bean. Is any inaccuracy introduced by the use of the arbitrary 
origin' 5 Notice the use of enumeration data as though they were measurements. This 
is often convenient. If the distribution is normal, one may safely make all inferences 
appropriate to normal measurement data For the test of normality, gee section 8.6. 


8.3 — Computation of mean and standard deviation of a frequency 
distribution — continued. In the last section the unit of measurement, 
one pound, was conveniently used as the class interval, but in the distri- 
bution of table 8.2 it takes an interval of 10 pounds to provide a satis- 
factory number of classes. The first class interval extends from 73 pounds 
up to (but not including) 85 pounds, the midpoint being 80 pounds. 


EXAMPLE 8.2 — Compute the mean and standard deviation of the live weights 
in table 8.2, taking G = 1 80 pounds. 

EXAMPLE 8.3 — The yields in grams of 1,499 rows of wheat arc recorded by Wiebe 
(10). They have been tabulated as follows: 


Class 

mark 

Fre- 

quency 

Class 

mark 

Fre- 

quency 

Class 

mark 

Fre- 

quency 

375 

3 

600 

127 

825 

10 

400 

13 

625 

140 

850 

10 

425 

41 

650 

122 

875 

4 

450 

99 

675 

94 

900 

4 

475 

97 

700 

64 

925 

2 

500 

118 

725 

49 

950 

3 

525 

138 

750 

31 

975 

1 

550 

146 

775 

26 

1,000 

1 

575 

136 

800 

20 

Total 

1,499 


Compute f * 587.74 grains, and s =* 100.55 grams. Are there enough classes in this 
distribution? 
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EXAMPLE 8.4 — Linds trom (5) reported the numbers of rows of kernels on 327 
ears of the F% generation derived from a cross of inbred lines of maize : 


Number of rows of kernels 

12 

14 

16 

18 

20 

22 

Number of cars 

25 

75 

133 

68 

21 

5 


The mean is 16.00 rows, the standard deviation 2.136 rows. Is any inaccuracy intro- 
duced by the small number of classes? 


TABLE 8.2 

Frequency Distribution of Live Weights of 533 Swine. Computation of Mean 
and Standard Deviation With a Class Interval Different From One Unit 


Class interval, 7=10 pounds 

G = 170 pounds 





Sum of 


Class-mark 

Frequency 

Code Nurrtbers 

Code Numbers 

Squares 

Pounds 

j 

A' 

fX 

/.V 2 

80 

1 

-9 

- 9 

81 

90 

0 

— 8 

0 

0 

100 

0 

-7 

0 

0 

110 

7 

— 6 

-42 

252 

120 

18 

-5 

-90 

450 

130 

21 

-4 

-84 

336 

140 

22 

— 3 

-66 

198 

150 

44 

— 2 

-88 

176 

160 

67 

-1 

-67 

67 

170 

76 

0 

0 

0 

180 

55 

1 

55 

55 

190 

57 

2 

114 

228 

200 

47 

3 

141 

423 

210 

33 

4 

132 

528 

220 

30 

5 

150 

750 

230 

23 

6 

138 

828 

240 

11 

7 

77 

539 

250 

5 

8 

40 

320 

260 

5 

9 

45 

405 

270 

4 

10 

40 

400 

280 

5 

11 

55 

605 

290 

2 

12 

24 

288 

n 

= 533 


-446 SJX 2 = 

6,929 


1,011 
SJX « 565 


SfX « 565 

I(SfX)/n « 10(565/533) 

= 11 pounds 
* - O + I{SJX)/n 
- 170 -f 11 
=181 pounds 

» VL189 86/533 


SfX* = 6,929 

{SJX)*/n - (565) a /533 - 598 92 

For code numbers, Sx* = 6,330.08 

* 2 = I*{Sx*)/(n - 1 ) 

= (10) 2 (6,330.08)/532 
= 1,189.86 


= 1 .49 pounds s = 34 5 pounds 
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G was selected as the midpoint of the class of greatest frequency despite 
the fact that this was not the middle class. (The size of the correction for 
mean,. 11 pounds, shows that G might well have been taken as 180 pounds.) 
The class interval, 7=10 pounds, enters the calculations of both mean 
and variance, since both must be decoded. 

You will observe that we still haven’t provided enough classes. The 
standard deviation falls short of being four times the class interval, but 
the discrepancy is not serious. 


8.4 — Miscellaneous remarks about frequency distributions. There 
exists some confusion about the desirable number of classes for a fre- 
quency distribution. One thinks of two distinct objectives: (i) the presen- 
tation of data in summary form, accompanied often by a graph, and (ii) 
the calculation of the sample statistics. It is only for the latter that 20 
or more classes are required. For summarization it is usually desirable 
to keep the number of classes less than 10. 

It requires care to get the class mark in the center of its interval. 
Suppose weights are recorded in whole pounds with no fractional values. 
It is the usual practice to estimate such weights to at least the closest 
half pound; that is, the record of 25 pounds probably included all weights 
between 24 and 2 5 pounds. If the class intervals are chosen as 20-29 
pounds, 30-39 pounds, etc., the first class really extends from 19.5 to 

29.5 pounds. In such a scheme, the class mark is (19.5 + 29.5)/ 2 = 

24.5 pounds, not 25. Fix the class limits as precisely as feasible, then 
compute the midpoint to get the class mark. 

It is assumed that the items falling into a class are somewhat evenly 
distributed throughout the interval. When this expectation is realized 
the mean of the items will be not greatly different from the class mark. 
This is especially likely in the high frequency intervals bearing the greatest 
weights in the computations. Caution should be exercised, then, if there 
are natural groupings in the interval. An instance was observed where 
the number of locules (seed compartments) in tomatoes was the variate, 
its values being confined to whole numbers and halves. Only a few of the 
fruits had half locules. At first, the class intervals were chosen to extend 
from 2 up to but not including 3, etc., the class marks falling at 2^, 3 Y% 
etc. Actually, the mean numbers of locules in the classes were almost at 
the lower boundaries, 2, 3, etc. This systematic error led to an over- 
statement of almost half a locule in the mean. 

Those desiring to refine the estimate of standard deviation of a con- 
tinuous variate may use Sheppard’s correction for grouping (9). This 
consists in deducting onc-twelfth of a unit from the coded variance, 
provided the code numbers differ by unity as in tables 8.1 and 8.2. In 
table 8.1 the correction would be 10.2014 — 0.0833 = 10.1181 with 
s = 3.181 pounds. This estimate, however, should not be used in a test 
of significance. 
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EXAMPLE 8.5 — In table 8.2, apply Sheppard’s correction to derive the estimate, 
/ ■* 34,37 pounds. Before correction, the coded variance is 11.8986. 

EXAMPLE 8.6 — Compute the corrected value of s , 100.55 grams, in example 8.3. 

EXAMPLE 8.7 — In examples 8.1 and 8.4, whose variables are discrete, is there 
any error of grouping to be corrected? 

EXAMPLE 8.8 — Two lots of mice inoculated with different isolations of typhoid 
(see section 11.10 for reference) survived during the numbers of days indicated by the 
following frequency distributions. Test the significance of the difference between the 
means of days to death (chapter 4). The sums of squares are 316.88 and 209 64; the 


Days to death 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

Strain 9D 

7 

5 

2 

16 

11 

7 

2 

5 

2 

2 

0 

0 

Strain DSC 1 

1 

0 

0 

1 

6 

8 

4 

6 

3 

3 

1 

3 


difference between means, 2.628 days, t «= 5.22, and d.f. = 93. 


8.5 — The test of symmetry. At various points in this text where you 
have been asked to judge of the normality of sets of data the only criteria 
available were symmetry and the proportion of items in the interval 
x =*= s. Adequate tests of normality will be described in this and following 
sections (4). We shall consider two types of departure from the normal. 
In one, the distribution of the data is asymmetrical or skewed, the mean 
and median • being different. The other occurs in symmetrical sets, 
characterized by either an excess or a deficit of items concentrated near 
the center of the range. For the sake of simplicity in presentation, we 
shall first illustrate the test for symmetry only, using a small sample. 

In section 2.17 the array of weights of 11 men was cited as an example 
of an asymmetrical set. Let us measure the skewness and test significance. 
The calculations are set out in table 8.3. The first unfamiliar feature 
is the column of cubes with their sum, Sz — 248,628. Next appear the two 
k statistics, kt being our old friend variance, and k% a corresponding aver- 
age of the third powers of the deviations from mean. The measure of 
skewness is gi — 1.96. You will see immediately that Sz and therefore gi 
might be positive, negative or zero. If gi were zero, symmetry in the 
sample would be demonstrated. A positive gi, as in our illustration, 
indicates an excess in the number of items smaller than the mean. If gi 
proves to be significantly different from zero, the excess of small values 
may then be ascribed to asymmetry in the population from which the 
sample is drawn. The standard error of gi turns out to be 0.661, so that 
t = 2.96. The test of significance is carried out with an infinite number 
of degrees of freedom, the 1% level of t being 2.576. 

To assist you in following the computations we have carried an un- 
necessary number of significant figures in table 8.3. For practice, you 
might recalculate the statistics with the use of no more than four significant 
figures in any one number. 

As a second illustration, let us test the skewness of the weights of 
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TABLE 8.3 

The Test of Symmetry in a Small Sample 
Weights (pounds) of 1 1 men 


Weight Deviation Square Cube 

X x * 2 ** 


148 


-24 

576 

-13,824 

154 


-18 

324 

- 5,832 

158 


-14 

196 

- 2,744 

160 


-12 

144 

- 1,728 

161 


-11 

121 

- 1,331 

162 


-10 

100 

- 1,000 

166 


- 6 

36 

- 216 

170 


- 2 

4 

8 

182 


10 

100 

1,000 

195 


23 

529 

12,167 

236 


64 

4,096 

262,144 

x s* 172 

.V 

i - 0 

St - 6,226 

S, - 248,628 

n - 11 

kt - Si/(n 

- 1) 

1 

<o 

« 

II 

--7 

- 2) 


- 6,226/10 

« (11)(248,628)/(10)(9) 


- 622 6 


- 2,734,908/90 

- 30,387 9 


g\ “ KJ \Zit 2 8 * k i /(k i )(\/k^j s 0 j 2 = 6 n{n - 1 )/{n - 2 ){n -f 1)(« -f- 3) 

« 30,387 9/(622.6) (V622 6J - (6)(1 1)(10)/(9)(12)(14) 

- 30,387.9/15,535 - 1 96 - 660/1,512 «■ 0 437 

s oi — 0 661, t = g\/s a \ = 1.96/0 661 — 2.96, df ** » 


neutral fats in the blood plasma of normal men, using the data recorded 
in example 2.43 These data call for the introduction of the machine 
method of calculation. Accumulate the sum of the 64 weights, = 14,361; 
the sum of their squares, S 2 = 4,318,741; and the sum of cubes, s$ = 
1,878,426,855. The last sum is obtained most easily by taking the cubes 
from one of the many available tables containing the first three powers of 
the integers (1) (2) (6). 

From s 2 you compute in the usual manner the sum of the squares of 
deviations from mean, 

St = s 2 - si*/n = 4,518,741 - (14,361) 2 /64 = 1,296,267, 

si being the sum of the A^s. For S 3 , the sum of the cubes of deviations 
from the mean, the corresponding formula is 

~ Ss — 3sis 2 /n + 2s\ l /n 2 

= 1,878,426,855 - 3(14,361) (4,51 8,741 )/64 + 2(14,361)V(64) 2 

- 282,723,308 

Armed with these results, you complete the computations indicated 
in table 8.3, ending with g! = 1.5696 and t = 5.24. Certainly, there is 
no doubt of asymmetry in the population from which this sample was 
drawn. The positive g i indicates a mean greater than the median. 
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EXAMPLE 8.9 — Use the machine method for calculating Si and S t in tabic 8.3. 

EXAMPLE 8.10 — In example 2.13, the value of^i =*= S „ i for the 20 guinea pigs is 
0.15 0.51. The value of g\ for example 2.39 is zero. Computing the statistics for 

one or both these sets will afford plenty of practice. 

8.6 — The tests of normality in large samples. One can test both 
skewness and kurtosis , two ways in which distributions may depart from 
normal. Kurtosis is measured by g 2 , a statistic based on the sum of the 
fourth powers of deviations from mean. If g$ is zero, there is no departure 
from normality so far as this measure is concerned. A positive value of 
gi indicates an excess of items near the mean and far from it, with a 
corresponding depletion of the flanks of the distribution. This is the man- 
ner in which the distribution of t departs from normal. Negative values 
of g 2 result from flat-topped distribution curves. 

The computations are shown in table 8.4. The first five columns 

TABLE 8.4 

Test of Normality in a Frequency Distribution 
Diameter (mm.) of ears in F 2 generation of yellow dent inbreds 
of maize (Lindstrom) 


Fre- 


Class 

mark 

quency 

/ 

Code 

A' 

/T 

A’ 

/X* 


36 

1 

-8 

- 8 

64 

- 512 

4,096 

37 

0 

-7 

0 

0 

0 

0 

38 

4 

-6 

-24 

144 

- 864 

5,184 

39 

7 

-5 

-35 

175 

- 875 

4,375 

40 

18 

-4 

-72 

288 

-1,152 

4 , 608 

41 

26 

-3 

-78 

234 

- 702 

2,106 

42 

28 

-2 

-56 

112 

- 224 

448 

43 

51 

-1 

-51 

51 

- 51 

51 

44 

47 

0 

0 

0 

0 

0 

45 

49 

1 

49 

49 

49 

49 

46 

31 

2 

62 

124 

248 

496 

47 

33 

3 

99 

297 

891 

2,673 

48 

19 

4 

76 

304 

1,216 

4,864 

49 

4 

5 

20 

100 

500 

2,500 

50 

8 

6 

48 

288 

1,728 

10,368 

51 

1 

7 

7 

49 

343 

2,401 


327 

n 

* 37 j. 

» 2,279 

s* - 595 s t 

» 44,219 


S 2 = si - s\*/n = 2,274.81 

S, * j a ~ 3W« + 2ji 8 /n 2 - -177.659 

Si — si — As\Sz/n T bsihi/n 2 — 3n 4 /« a — 44,124 6 


= Sl /n = 0 1131 
hi « S 2 /(n - 1) - 6.9779 
k, - nSi/(n - l)(n - 2) - -0 5483 

ki « n[(« + 1 )S 4 - 3 (n - l)W/»l/(« - 1)(» ~ 2)(n - 3) = -9.563 


gi = k t /(k 2 y/k 2 ) - -0 0297 g 2 « kjkj « - 0.1964 

j,i* - 6n(rt - 1 )/(n - 2 )(n + 1 )(n + 3) - 0 0182 
v - 24 n(n - 1 ) s /(« 7 3 )(n - 2)(» + 3) (a + 5) - 0.0723 
Standard deviations: of £ 1 , 0.135; of g 2y 0.269 
Values of V for g\ , 0.22; for £ 2 , 0.73; dj * « 
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correspond to those of tables 8.1 and 8.2. The numbers in the last four 
columns are calculated by repeated multiplication by X. Take line 3 as 
an example: (4) ( — 6) = —24, ( — 24) ( — 6) = 144, (144)( — 6) = —864, 
and ( — 864) ( — 6) = 5,184. You are already familiar with the remaining 
calculations save those of £4, fa , and s 9 2. The formulas are intricate 
but may be followed with a little care. 

You notice that both g\ and g 2 are small, each being exceeded by its 
standard error. The negative £1 indicates a slight asymmetry with an 
excess of items larger than the mean drawing the peak of the frequency 
curve toward the right. The negative g% shows a moderate kurtosis de- 
scribed by a distribution curve plateau-like near the center with an excess 
of moderate deviations. Since neither g is significant, however, there is 
little evidence of departure from normality in the population. 

EXAMPLE 8.11— Compute the mean ear diameter from table 8.4, together with 
its standard error, 44.11 =±= 0 146 mm. 

EXAMPLE 8 12 — In tabic 3.2, compute gi ** —0.0139 and £2 = 0.0460, showing 
that the distribution is practically normal. 

EXAMPLE 8 13- In table 3.6 is the sampling distribution of 511 standard devia- 
tions Calculate g\ =* 0.3074 with standard error, 0.108. This indicates that the 
anticipated departure from normality is significant. £1 is positive because the mode is 
to the left of the mean. 

EXAMPLE 8 14 — The 511 values of t discussed in section 3.7 were distributed as 
follows 


Class 


Class 


Class 


Class 


mai k 

/ 

mark 

./ 

mark 

/ 

mark 

/ 

-3 13 

3 

-1 13 

29 

0 87 

31 

2 87 

1 

-2 88 

5 

-0 88 

35 

1 12 

23 

3 12 

1 

-2 63 

1 

-0 63 

38 

1 37 

17 

3 37 

2 

- 2 38 

3 

-0 38 

40 

1 62 

11 

3 62 

0 

-2 13 

6 

-0 13 

52 

1 87 

8 

3 87 

0 

-1 88 

12 

0 12 

57 

1 2 12 

10 

4 12 

0 

-1 63 

21 

0 37 

43 

2 37 

6 

4 37 

1 

-1 38 

16 

0 62 

37 

; 2 62 

2 


— 







Total 511 


The highly significant value of g 2 = 0 5340 shows that the frequencies near the mode 
and in the tails are greater than in the normal distribution, those in the flanks being 
less This was expected. But gi = 0 1356 is nonsignificant, which is also expected 
because the theoretical distribution of t is symmetrical. 

8.7 — The graph of the normal curve. This ideal curve follows a 
famous mathematical formula (example 8.25), the graph of which is shown 
in figure 8.1. The abscissa is standard measure, t\ that is, deviation from 
mean expressed in terms of the population standard deviation as a unit. 
The ordinates are taken from table 8.5. As an example, at t = 1 the 
ordinate is 0.2420. The same ordinate is plotted at t — —1. As another 
example, at / = 2.46 the ordinate is read from the column headed 0.06 
in the table, opposite the entry t — 2.4 in the left column, the length 



178 


Statistical Methods 


Section 8.7 



--4 -3 -K. ~l O 1 *2L 3 -4 

STANDARD MEASURE! 

Fig. 8.1— The normal frequency curve. Abscissas are standard measure. Ordinates 
are taken from table 8.5. 


being 0.0194. You will notice that as the curve recedes from the origin it 
descends faster and faster until it reaches the ordinate at t = 1. Beyond 
that point, the descent is ever slower. 

t* It is easy to identify the ordinates in the table with those in such a 
distribution as that of ear diameters in table 8.4. You may have already 
constructed the histogram for the sample. Applying Sheppard’s correc- 
tion to the variance £ 2 in table 8.4, you have s — \//: 2 — 0.0833 = 
v/6.8946 = 2.626 mm. as the suitable estimate of the value of this 
parameter. There remains only a constant multiplier to transform the 
ordinates in table 8.5 to those appropriate for the sample. It is 


n _ 327 
s ~ 2.626 


124.5 


Start with the ordinate at the mean, 44.11 mm. Its value is (124.5) 
(0.3989) = 49.66 ears, the second factor being taken from the table at 
t = 0. This ordinate is plotted on the same scale of frequencies used in 
making your histogram. Next, compute the ordinate at one of the class 
marks, say 39 mm. The standard measure of this point is t = (39 — 44.11)/ 
2.626 = —1.95. The tabular ordinate corresponding is 0.0596; whence 
the ordinate to be plotted is (124.5) (0.0596) = 7.42 ears. As a final 
illustration, compute the ordiante at one of the class boundaries, say 
47.5 mm. You get t = 1.29, and thence 21.61 ears as the ordinate to be 
plotted. Proceeding in this manner, you compute as many ordinates as 
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TABLE 8.5 

Ordinates of the Normal Curve 


Second decimal place in t 
L 



0.00 

0.01 

0.02 

0 03 

0.04 

0.05 

0.06 

0.07 

0.08 

0.09 

0 0 

3989 

.3989 

3989 

.3988 

.3986 

.3984 

.3982 

.3980 

.3977 

.3973 

0.1 

.3970 

.3965 

.3961 

.3956 

.3951 

.3945 

.3939 

.3932 

.3925 

.3918 

0.2 

.3910 

.3902 

.3894 

.3885 

.3876 

.3867 

.3857 

.3847 

.3836 

.3825 

0.3 

.3814 

3802 

.3790 

.3778 

.3765 

.3752 

.3739 

.3725 

.3712 

.3697 

0 4 

.3683 

.3668 

.3653 

.3637 

.3621 

.3605 

.3589 

.3572 

.3555 

.3538 

0.5 

.3521 

.3503 

.3485 

.3467 

.3448 

.3429 

.3410 

.3391 

.3372 

.3352 

0 6 

3332 

.3312 

.3292 

3271 

.3251 

.3230 

.3209 

.3187 

.3166 

.3144 

0 7 

.3123 

.3101 

.3079 

3056 

.3034 

.3011 

.2989 

.2966 

2943 

2920 

0 8 

2897 

2874 

.2850 

.2827 

.2803 

.2780 

.2756 

.2732 

2709 

.2685 

0.9 

.2661 

.2637 

.2613 

2589 

.2565 

.2541 

.2516 

.2492 

.2468 

2444 

1 0 

.2420 

.2396 

.2371 

.2347 

.2323 

.2299 

.2275 

.2251 

2227 

.2203 

1 1 

2179 

2155 

.2131 

2107 

.2083 

.2059 

.2036 

.2012 

.1989 

.1965 

1.2 

1942 

.1919 

.1895 

.1872 

.1849 

.1826 

1804 

.1781 

.1758 

.1736 

1.3 

1714 

.1691 

1669 

.1647 

.1626 

.1604 

1582 

1561 

.1539 

1518 

1 4 

.1497 

.1476 

1456 

.1435 

.1415 

.1394 

.1374 

.1354 

.1334 

.1315 

1 5 

1295 

.1276 

.1257 

1238 

1219 

1200 

.1182 

.1163 

.1145 

.1127 

1 6 

1109 

.1092 

.1074 

.1057 

.1040 

1023 

.1006 

.0989 

.0973 

0957 

1.7 

.09-10 

0925 

0909 

.0893 

.0878 

0863 

0848 

.0833 

.0818 

0804 

I 8 

0790 

0775 

.0761 

0718 

.0734 

0721 

0707 

.0691 

0681 

.0669 

1 9 

0656 

0614 

.0632 

0620 

0608 

0596 

.0584 

.0573 

0562 

0551 

2.0 

0510 

0529 

.0519 

.0508 

.0498 

0188 

.0478 

.0468 

.0459 

.0449 

2.1 

.0410 

0131 

0122 

0413 

.0101 

0396 

0387 

0379 

.0371 

0363 

2 2 

0355 

0347 

.0339 

.0332 

.0325 

.0317 

0310 

.0303 

0297 

.0290 

2 3 

0283 

.0277 

0270 

0264 

.0258 

.0252 

.0216 

.0241 

.0235 

.0229 

2 4 

.0224 

0219 

.0213 

0208 

.0203 

0198 

.0194 

.0189 

.0184 

0180 

2.5 

.0175 

.0171 

.0167 

0163 

0158 

.0154 

.0151 

.0147 

.0143 

,0139 

2 6 

0136 

.0132 

.0129 

.0126 

.0122 

.0119 

.0116 

0113 

.0110 

0107 

2 7 

0104 

.0101 

0099 

0096 

0093 

.0091 

.0088 

.0086 

0084 

0081 

2 8 

0079 

0077 

.0075 

0073 

.0071 

0069 

0067 

.0065 

.0063 

.0061 

2 9 

0060 

.0058 

0056 

.0055 

0053 

.0051 

.0050 

.0048 

.0047 

0046 





First drain . 

d place 

in t 





0 0 

0 1 

0.2 

0 3 

0 1 

0 5 

0 6 

0 7 

0 8 

0 9 

3 

0044 

0033 

0024 

0017 

.0012 

0009 

0006 

.0004 

0003 

.0002 

4 

.0001 

.0001 

0001 

0000 

.0000 

0000 

0000 

0000 

.0000 

.0000 


may seem necessary for plotting the normal curve. The final graph will 
have the same general appearance as figure 8.1. 

8.8 — Table of the normal frequency distribution. Table 8.6 differs 
from the usual frequency distribution in one major and several minor 
respects. The chief difference is that the frequencies (percentages or 
areas) are cumulative. To illustrate, look in the second column, opposite 
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/ = 1.0 in the first, where you find 3413. The meaning is that 34.13% of 
the total frequency of 10,000 falls on values of * lying between zero and 
1 . 00 . This corresponds to the fact that 34.13% of the area of the curve in 
figure 8.1 lies between the ordinates at x = 0 and x = 1. The figure 
34 . 13 % is approximately half of the familiar 68.27%. Again, opposite 2.0 
you find 4772, meaning that 47.72% of the frequency lies between x = 0 

TABLE 8.6 

Cumulative Normal Frequency Distribution 
t is standard measure, n — 10,000 



/ 

.00 

01 

.02 

03 

04 

05 

06 

07 

08 

09 


.0 

0000 

0040 

0080 

0120 

0160 

0199 

0239 

0279 

0319 

0359 


1 

0398 

0438 

0478 

0517 

0557 

0596 

0636 

0675 

0714 

0753 


2 

0793 

0832 

0871 

0910 

0948 

0987 

1026 

1064 

1103 

1141 


.3 

1179 

1217 

1255 

1293 

1331 

1368 

1406 

1443 

1480 

1517 


A 

1554 

1591 

1628 

1664 

1700 

1736 

1772 

1808 

1844 

1879 


5 

1915 

1950 

1985 

2019 

2054 

2088 

2123 

2157 

2190 

2224 


6 

2257 

2291 

2324 

2357 

2389 

2422 

2454 

2486 

2517 

2549 


7 

2580 

2611 

2642 

2673 

2704 

2734 

2764 

2794 

2823 

2852 


.8 

2881 

2910 

2939 

2967 

2995 

3023 

3051 

3078 

3106 

3133 


.9 

3159 

3186 

3212 

3238 

3264 

3289 

3315 

3340 

3365 

3389 

1 

0 

3413 

3438 

3461 

3485 

3508 

3531 

3554 

3577 

3599 

3621 

1 

1 

3643 

3665 

3686 

3708 

3729 

3749 

3770 

3790 

3810 

3830 

1 

2 

3849 

3869 

3888 

3907 

3925 

3944 

3962 

3980 

3997 

4015 

1 

3 

4032 

4049 

4066 

4082 

4099 

4115 

4131 

4147 

4162 

4177 

1 

4 

4192 

4207 

4222 

4236 

4251 

4265 

4279 

4292 

4306 

4319 

1 

5 

4332 

4345 

4357 

4370 

4382 

4394 

4406 

4418 

4429 

4441 

1 

6 

4452 

4463 

4474 

4484 

4495 

4505 

4515 

4525 

4535 

4545 

1 

7 

4554 

4564 

4573 

4582 

4591 

4599 

4608 

4616 

4625 

4633 

1 

8 

4641 

4649 

4656 

4664 

4671 

4678 

4686 

4693 

4699 

4706 

1 

9 

4713 

4719 

4726 

4732 

4738 

4744 

4750 

4756 

4761 

4767 

2.0 

4772 

4778 

4783 

4788 

4793 

4798 

4803 

4808 

4812 

4817 

2 

1 

4821 

4826 

4830 

4834 

4838 

4842 

4846 

4850 

4854 

4857 

2 

.2 

4861 

4864 

4868 

4871 

4875 

4878 

4881 

4884 

4887 

4890 

2 

3 

4893 

4896 

4898 

4901 

4904 

4906 

4909 

4911 

4913 

4916 

2 

4 

4918 

4920 

4922 

4925 

4927 

4929 

4931 

4932 

4934 

4936 

2 

5 

4938 

4940 

4941 

4943 

4945 

4946 

4948 

4949 

4951 

4952 

2 

6 

4953 

4955 

4956 

4957 

4959 

4960 

4961 

4962 

4963 

4964 

2 

7 

4965 

4966 

4967 

4968 

4969 

4970 

4971 

4972 

4973 

4974 

2 

8 

4974 

4975 

4976 

4977 

4977 

4978 

4979 

4979 

4980 

4981 

2 

9 

4981 

4982 

4982 

4983 

4984 

4984 

4985 

4985 

4986 

4986 

3 

0 

4987 

4987 

4987 

4988 

4988 

4989 

4989 

4989 

4990 

4990 

3 

1 

4990 

4991 

4991 

4991 

4992 

4992 

4992 

4992 

4993 

4993 

3 2 

4993 

4993 

4994 

4994 

4994 

4994 

4994 

4995 

4995 

4995 

3 

3 

4995 

4995 

4995 

4996 

4996 

4996 

4996 

4996 

4996 

4997 

3 4 

4997 

4997 

4997 

4997 

4997 

4997 

4997 

4997 

4997 

4998 

3 

6 

4998 

4998 

4999 

4999 

4999 

4999 

4999 

4999 

4999 

4999 


3 9 5000 
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and x = 2. Doubling this percentage, you have 95.44% of the frequency 
within the interval x = =±=2. The remaining 4.56% of the frequency lies 
beyond these limits. In general, the numbers in this table correspond 
to areas in figure 8.1 between ordinates at 0 and x. 

Some minor differences between table 8.6 and an ordinary frequency 
distribution are: (i) Since t is standard measure, its values are measured 
in terms of the population standard deviation, cr, as a unit. (The Greek 
lower case sigma, <7, serves to distinguish this parameter from the statistic, 
s.) Thus, t = 2 means x = 2a or x/a = 2; (ii) the values of t are not mid- 
points of class intervals, but are distances to their outer boundaries; 
(iii) only the positive values of t are listed, corresponding to the right half 
of figure 8.1; (iv) values of t are given to two decimal places, the second 
being read at the head of the column. To illustrate this last point, con- 
sider the number 4535 in the column headed 0.08 opposite 1.6 in column 
one. The interpretation is that 45.35% of the frequency lies between 
t = 0 and t = 1.68. 

Let us determine the value of t such that 5% of the frequency will lie 
beyond the limits, —t and +/. This implies that 2.5% will lie to the 
right uj t, with 47.50% between 0 and t. From the table, the value of t 
corresponding to 47.50% is 1 .96. This, then, is the 5% level in the normal 
frequency distribution. In working with large samples, the approximate 
value a: — 2 is often referred to as the 5% level, though the more precise 
value, 1.96, is sometimes met. In table 3 8 you will find this figure 1.96 
as the 5% value of t for infinite degrees of freedom. That is because t 
tends toward normal distribution in very large samples. 

In large samples where no distinction is made between degrees of 
freedom and number of observations, it is customary to test the significance 
of the difference between two means without pooling the sums of squares. 
The theorem of section 3.9 is used as before, but the variances of the two 
means are computed separately, then added. The formula is 

cither = Ji 2 / W i + ^ 2 /n 2 , 

or Si 2 = Sn 2 + s i2 z 

where s 1 2 and Sx 1 2 belong to the first sample, s 2 2 and jj 2 2 to the second. As an 
example, it is reported that for 841 13-year-old boys the mean height 
was Xi = 57.3 inches with si = 2.76 inches, while for 784 girls of the same 
age, x 2 ~ 58.6 and si = 2.44 inches. The difference of the means is 
therefore 1.3 inches with standard error, 

\/( 2.76)7841 + (2.44)7784 = 0.129 inch 

Since t = 1.3/0.129 = 10 is greater than 2, the superior height of the 
girls at age 13 may be accepted as a population characteristic. 

In the borders of table 8.6 containing values of standard measure, 
t = tf/a, you will note the range of this variable in a very large sample, 
from about —4 to 4. In smaller samples, the average range decreases as 
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indicated in table 5.5. In all normally distributed populations standard 
measures are directly comparable. That is, values of x/a between zero 
and 1.4, for example, occur with the same probability, 41.92%, no 
matter what the actual units of measurement may be. 

In example 1.25 we had x 2 — 12.2, a value far beyond the limits of 
the table. To evaluate the probability, citation was made to this section. 
Now, it is shown in mathematical statistics that the square root, x> of 
the chi-square in chapter 1 is distributed like t in table 8.6. Hence, 
X = V12.2 = 3.49 may be used as t to enter the present table. The 
probability read is 0.4998, corresponding to 0.0002 for values greater 
than t = 3.49. For the reason presented in the next paragraph, this 
probability is doubled, so that we have P = 0.0004 that a more divergent 
sample will occur in random drawing from the hypothetical population. 
This is the basis of the statement made in example 1.25. 

It is now clear that the class intervals for chi-square in table 1.4 may 
be taken from table 8.6. As an example, let us verify the fact that 10% 
of the total frequency lies in the interval from x 2 = 0.0158 to x 2 = 0.0642. 
The corresponding values for x — * are 0.1257 and 0.2543. Interpolating 
these values in the table, the probabilities 0.05 and 0.10 are taken out. 
Hence, the difference, 0.10 — 0 05 = 0.05, or 5% of the total frequency 
occurs in this interval. But, since x 2 is always positive, the corresponding x 
distribution is to be taken over only half of the total frequency. Hence 5% 
of the whole curve constitutes 10% of the right half. Other illustrations of 
the use of the table will be given in the examples to follow. 


EXAMPLE 8.15 — Using table 8 6, show that 92.16% of the items in a normally 
distributed population lie between — 1.76a and 1 76 a. 

EXAMPLE 8.16 — Show that 65.24% of the items in a normal population he 
between x = — 1.1a- and x — 0.8a, 

EXAMPLE 8 17 — Show that 13.59% of the items are between x = a and x = 2a. 

EXAMPLE 8.18 — Using table 8 6 show that approximately 50%) of the items lie 
in the interval from —0.6745a to 0 6745a. Compare this with section 2.16. 

EXAMPLE 8 19 — Show that the 1% level is at x/a = 2.575. You will find a 
slightly more precise value in table 3 8 at n — » . 

EXAMPLE 8.20 — Verify the statement that the interval x =*= 1 645a contains 90% 
of a normal frequency. 

EXAMPLE 8.21— In table 8 4, compute the frequency expected in the classes; 
(i) from 42.5 to 43.5 mm. (Ans 45 2), (li) from 43 5 to 44 5 mm. (Ans. 49.2). Hint: 
Take £ — 44.11 and s = 2.626 mm., then for the class boundary 42.5, t =* (42.5 
- 44.1 1 )/2 626 « -0.61. 

EXAMPLE 8.22 — If you should continue the process suggested in the foregoing 
example, you would have an observed and expected frequency foi every class. Could 
you test the significance of the departure from normal by the use of chi-square? 

EXAMPLE 8.23 — In example 1.23 the value of x 2 was found to be 8.23. What is 
the probability of a larger value under the hypothesis of independence > Ans. 0 004. 

EXAMPLE 8.24 — The statement was made that of 1,000 13-year-old boys, 390 
had heights within 1.4 inches of the mean height, 57.3 inches. Show that r = 2.75 
inches on the assumption of normal distribution. 
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EXAMPLE 8.25 — The mean of 1,000 normally distributed plot yields of corn was 
20.50 pounds per plot. It was observed that 200 of the plots yielded 17.75 pounds or 
less. Estimate the standard deviation of the yields. Ans. 3.27 pounds per plot. 

EXAMPLE 8.26 — The mean and standard error of the dressing percentages in 
one large group of swine were 81 7 0.46%, in another, 80.0 =*= 0 37% Show that 

for testing the mean difference t = 2.88, a highly significant value 

EXAMPLE 8.27 — The equation of the normal curve is 

n -*/2 0* 


From this, those with some mathematical training can trace the curve, showing that a 
point of inflexion occurs at x = =*■ cr . Table 8.6 contains values of the integral, 



- *72 

dt 


in which t = x/a. Can you integrate this function* 


8.9 — The calculation of the correlation coefficient in a two-way 
frequency distribution. The double entry tabic 8.7 contains in compact 
form the record of two measurements on each of 327 ears of corn (5). 
The diameters, X, were grouped into millimeter classes as in table 8.4, 
the frequencies and code numbers appearing in the bottom rows of the 
table. The weight classes, Y, are 10 grams each with frequency and code 
at the right. Correlation in these data is evidenced by the tendency of 
high frequencies to lie along the diagonal of the table, leaving two corners 
blank — there are no very heavy ears with small diameters. In each di- 
ameter array (column) there is notable variation in weight, but it is not 
nearly so great as the entire range displayed in the lefthand column. The 
regression of weight on diameter could be represented by a line following 
the upward trend of the data. This line would be fitted by the method 
of least squares to the means of the successive diameter arrays. 

The calculations follow patterns already familiar. Those leading to 
Sx 2 are taken from that part of table 8.4 which is the counterpart of table 
8.1. The corresponding calculations for weight are carried through with 
the data in columns f v and Y. The results are summarized at the bottom of 
the table. 

The device for arriving at SXY, displayed in the last two columns of 
the table, has not been explained before. Each number in column SXj x 
is calculated from the frequencies in that row together with the code 
numbers in the bottom line. As examples: 

(i) In the 3rd line: (3) (4) = 12 

(ii) In the 4th line: (1)(2) + (1)(4) = 6 

(hi) In the 5th line: (1)(1) + (2) (2) + (1)(3) + (4)(6) = 32 

(iv) In the 7th line: (1 ) ( — 2) + (3) ( — 1 ) + (7) (1 ) + (3) (3) + (3) (4) = 23 

The products in the last column are got from the two columns preceding. 
The sum at the end of this column is the desired SXY. You remember 



TABLE 8.7 

Computation of Correlation Coefficient in Two-way Frequency Table 

Frequency of Occurrence of Ears of Maize having each Diameter and Weight 
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TABLE 8.7— (Continued) 


SXf x - 37 

S.Vf x - 2,279 
{SXf x Y/n = 4 19 


AV = 2,274 81 


SYJ U = —46 AA'F - 2,318 

sf*/ v = 7,264 (&iy,)(.sr/ 1 ,)/» - -5 20 

(. SYf y ) 2 /n = 6 47 - 

2,323 20 

A> 2 = 7,257 53 


r 


A*y 


2,323 20 


V (&*) (A> 2 ) v ^2,274^81 )(7, 257 53) 


= 0.5718 


that no decoding is necessary for r. Incidentally, the sum of the column, 
SXJ T , checks si in tabic 8.4. 

EXAMPLE 8 28 — Using the data m columns f u and F, table 8 7, calculate A’y 2 
- 7,257 53, together with the mean and standard error, 198 6 =*= 2 61. 

EXAMPLE 8.29 - Calculate the mean, 44.1, and standard deviation, 2.64, in the 
42-millimcter array of diameters, tabic 8 7. 

EXAMPLE 8.30 — In the 200-gram array of weights, compute x = 198 6 and 
s « 47 18. 

EXAMPLE 8 31 - -Compute the regression corfl.cient of weight on diameter, 
1 0213, together with the regression equation, Y — 1 0213A' + 1 54 81. 

EXAMPLE 8 32 -Calculate the mean \\< lght of the ears in each of the 16 diameter 
arras s of table 8 7 Picsent these means graphic ally as ordinates with the corresponding 
diametf is as abscissas Plot the graph of the regression equation on the same figure Do 
>ou get a good fit- 3 Is there any evidence ol curvilmearity in the regression of means- 1 

EXAMPLE 8.33 — Calculate the mean diameter in each of the 28 weight arravs. 
Plot these means against the weight class marks Docs there seem to bo any pronounced 
on vj linearity in the regression of these mean diameters on the weight? Can you write 
the regression equation giving estimated diametei for ear h weight- 1 

EXAMPLE 8 34 — If you are interested in mathematics, you will naturally wish 
to derive the formula for SXY in table 8 7. This table, m fact, introduces many fasci- 
nating problems. You will find an excellent discussion, with bibliography, in (8). 


8.10— Some theorems about the correlation coefficient in large 
samples. With increasing sample size the distribution of r tends toward 
the normal. Furthermore, there ceases to be any notable numerical 
differentiation between the use of «, n — 1, or n — 2 in the calculations 
or tests of significance. The results are marked simplifications in many 
of the formulas. Sonic of them will be introduced here for the sake of the 
accompanying interpretations of r. In reference (7) many others are 
available to those who have some training in mathematics. 

The formula for variance of estimate, section 6.8, is readily trans- 
formed (compare section 7.8) to 

V * 2 = (1 - r 2 K 2 

The corresponding formula for the variance of estimated values of Y is 

jjj 2 = r 2 s^ 



186 


Statistical Methods 


Section 8.10 


These formulas emphasize the intimate relation between the correlation 
coefficient and the concepts of regression. They may be written 

r 2 = s$ 2 / Sy and 1 — r 2 = s v . x 2 /s v 2 

Thus, r 2 measures the fraction of the variance of F which is due to regres- 
sion on X, while 1 — r 2 measures the remaining fraction independent of X. 
A small value of r 2 goes hand in hand with a small variation in estimated 
F, and with a large variation in errors of estimate. Again, if r 2 is large, 
regression itself accounts for most of the changes in F, the errors of estimate 
being small. Still another explanation can be advanced if you recall the 
fact, clearly evident in the formula of section 7.4, that r is a measure of 
the correspondence of variations in two sets of deviations from mean. If 
these variations keep closely in step then the covariation is large, and the 
variation of X may be said to account for most of the variation of F. In 
that event, the residual, unaccounted-for variation, s v . x 2 is small relative 

tO Sy \ 

If desirable, the fraction, r 2 = s„ 2 / r„ 2 may be written as a percentage. 
For example, if r = 0.60, then r 2 = 0.36 and it may be said that 36% 
of the variance of F is attributable to regression, leaving 64% unaccounted 
for. 

One must discipline his mental processes if he is to think clearly in 
terms of variance. The effort is worthwhile. Why not use standard 
deviation instead? One would expect it to be simpler. Try reducing some 
of the foregoing statements to corresponding ones about standard devia- 
tion. For example, since r = s$/s V9 the correlation coefficient itself may be 
thought of as the percentage of the standard deviation which is due to 
regression. Then r = 0.60 means that 60% of s v is accounted for by the 
regression of F on X. That is easy. But here is the hitch: \/ 1 — (0.60) 2 
or 80% is unaccounted for. The two percentages of standard deviation do 
not add to 100%, whereas those of variance do. It is more convenient, 
therefore, to talk about variance; but it is of the utmost importance not to 
think “standard deviation” when you say “variance.” Standard deviation 
is an objective concept, being measured in familiar units like pounds or 
inches. Variance is a mathematical concept whose mastery requires 
thought and experience. 
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Chapter 9 


ENUMERATION DATA WITH MORE THAN 
ONE DEGREE OF FREEDOM 


9.1 — Introduction. The chief purpose of chapter 1 was the easy 
introduction of sample-based inferences about populations. Unbiased 
estimates, confidence statements and tests of null hypotheses were ex- 
plained with as little diversion as seemed feasible. Those primarily inter- 
ested in enumeration statistics were referred to this chapter for further 
exposition. 

In the intervening chapters a good deal has been said about the normal 
distribution: it is the basis for many of the statistical methods devised 
for measurement data. The methods of the present chapter, including 
those of chapter 1, are appropriate for samples from two distributions 
known as the binomial and the Poisson. In chapter 16, these distributions 
will be described in somewhat the same fashion as the normal was in 
chapter 8. We are now ready to extend the methods of chapter 1 in two 
ways: to experiments involving more than 1 sample and to samples w r ith 
more than 2 classes of individuals. 


9.2 — An experiment with more than 1 sample. It is often both 
convenient and informative to collect data in several small samples 
rather than in a single large one. An example is furnished by some experi- 
ments on chlorophyll inheritance in maize (13) reported in table 9.1 . This 
series consisted of 1 1 progenies of heterozygous green plants, self fertilized, 
segregating into dominant green and recessive yellow, presumably in the 
ratio of 3 to 1 . 

As an illustration, consider the first progeny with 122 plants, 98 green 
and 24 yellow. Under the 3T hypothesis the numbers expected are 
three-fourths of 122 = 91.5 green, and one-fourth, 30.5, yellow; then chi- 
square (review section 1 8 if necessary) is, 


(98 - 91 .5) 2 (24 - 30. 5) 2 

91~5 ' 305 


1 85, 


not an excessive value in random sampling from homogeneous material. 
Omission of the adjustment for continuity will be explained in sec- 
tion 9.4. 

The remaining chi-squares are similarly calculated and recorded in 
the table. Since most of them are small and all nonsignificant, there is 
no evidence that the 1 1 progeny samples may not have been drawn from 
a 3:1 population In fact, somewhat more variation might have been 

[ 188 ] 
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TABLE 9.1 

Numbers of Green and Yellow Seedlings in 11 Progenies of Self-Fertilized, 
Heterozygous Green Plants, With Numbers Expected Under the 3:1 Hypothesis, 
and Values of Chi-Square 


Number of 
Plants 

Green 

Yellow 

Expected 

Green 

Expected 

Yellow 

Chi-square 

122 

98 

24 

91 50 

30 50 

1 85 

149 

110 

39 

111 75 

37 25 

0 11 

86 

68 

18 

64 50 

21 50 

0 76 

55 

42 

13 

41 25 

13 75 

0 05 

71 

54 

17 

53 25 

17 75 

0 04 

179 

141 

38 

134 25 

44 75 

1 36 

150 

120 

30 

112 50 

37 50 

2 00 

36 

27 

9 

27 00 

9.00 

0 00 

91 

70 

21 

68 25 

22 75 

0 17 

53 

39 

14 

39 75 

13 25 

0.05 

111 

85 

26 

83 25 

27 75 

0 15 

Sum of 11 chi-squares 




6 54 

Total 1,103 

854 

249 

827 25 

275 25 

3 46 


expected: you remember that about 1 random sample among 20 has 
chi-square greater than 3.841. 

A valuable property of chi-square is that sums of sample values are, 
themselves, distributed in the characteristic chi-square fashion. But such 
sums have distributions different from that which became familiar in chap- 
ter 1. The earlier distribution of chi-square was for a single degree of 
freedom, while the one for the sum of v chi-squares, each with d.f. = 1, 
has n degrees of freedom. The sum, x 1 = 6.54, in table 9.1 has d.f. = 11, 
and the probability of a larger may be determined by looking along line 1 1 
in table 9.2. It lies about 7/10 of the interval from 0.9 to 0.8: that is, at 
approximately P = 0 83. Clearly, the deviations from 3:1 in these 11 
progenies are less than would ordinarily be encountered in sampling 
from a single binomial population 

The advantage of the sum of chi-squares is that information in the 
smaller samples is accumulated in the manner of large samples. To 
illustrate: the sum of 10 chi-squares might be 27.06, 10 times the 10% 
point for 1 degree of freedom; yet for 10 df. this sum is beyond the 1% 
point. No one of the small samples might have tested significant, but some 
uniform tendency toward the larger chi-squares makes the sum quite 
convincing. On the other hand, if the individual chi-squares tend to be 
small, the accumulated information makes for stronger evidence of ho- 
mogeneity. You will see this by supposing that the sum of 10 chi-square 
is 10 times the 50% point for 1 degree of freedom. Try also, the 30% 
point. 

Returning to the corn seedlings, one may well consider pooling the 
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From table III of referenced) with permission of Professor Fisher and of his publishers, Oliver and Boyd, Edinburgh. 
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11 progenies into one sample of 1,103 plants because there is so far no 
evidence against the 3:1 hypothesis. This is done in the last line of the 
table. The large value, x 2 = 3.46, d.f. = 1, P = .07, attracts attention to 
an excess of green plants in all but 3 of the progenies, an excess that 
accumulates to 854 — 827.25 = 26.75 plants. While chi-square is less 
than the 5% point, it does suggest something to look out for in other 
experiments: even though no one of the subsample chi-squares were 
large, yet a persistent bias might lead to a significant value for the merged 
data. Thus the combined sample might bring to light a slight deviation 
from hypothetical ratio, undetected in the smaller portions (see example 
9.1). 

There is a third feature of tables like 9.1 about which evidence is 
available — the consistency or inconsistency of the progeny ratios. This is 
indicated by a third chi-square, the difference between the first two: 



Degrees of Freedom 

Chi-square 

Total 

11 

6 54 

Pooled 

1 

3.46 

Interaction 

10 

3 08 


This interaction or heterogeneity chi-square (8) (9) is distributed with degrees 
of freedom equal to the difference between those for the total and pooled 
chi-squares. In other experiments this might be informative for this 
reason; since deviations are squared, and since the progeny chi-squares 
therefore fail to distinguish between the positive and the negative, the 
interaction furnishes a test of discrepancies among them. 

To illustrate the presence of inter action, the ratios in table 9.3 were 
selected with alternate positive and negative deviations. The four sample 
chi-squares arc each large though nonsignificant. The total with 4 degrees 
of freedom is close to the 2% point, indicating over-all deviation from the 


TABLE 93 

Ratios Selected to Illustrate Interaction in Samples From a 3:1 Population 



Number Designated 

Deviation From 3.1 


Size of 






A 

B 

A 

B 

Chi-square 

Subsample 

116 

78 

38 

-9 

9 

3 724 

92 

76 

10 

7 

-7 

2 841 

152 

106 

46 

-8 

8 

2 246 

124 

101 

23 

8 

-8 

2.753 

484 

1 361 

123 



11 56 


Total 

4 

11 56 




Pooled 

1 

0 04 




Interaction 3 11 52, P~ 0 01 (approximately) 
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3:1 ratio. But the pooled data segregate almost 3:1. Comparison of the 
2 probabilities suggests heterogeneity among the progeny ratios, further 
evidence being supplied by P = 0.01 for the interaction chi-square. In 
such an experiment the pooled data have little meaning — apparently 
none of the progenies represent 3:1 populations. It may be well to empha- 
size this point: the pooled data with their corresponding chi-square 
contain evidence about the hypothetical ratio only if the several sample 
ratios are homogeneous as evidenced by a small heterogeneity chi-square. 

EXAMPLE 9.1 — Compute the several chi-squares appropriate to the ratios 
98:41, 71.31, 127.52, 61 25, 86'36, the theoretical ratio being 3.1. The total chi-square 
is 6.72, with 5 degrees of freedom. For the pooled data, x 2 = 6 65, df. — 1. This 
example illustrates the situation in which there is a persistent excess of recessive plants, 
strong enough to boost the chi-square of the pooled data to significance. Consistency 
in the progeny ratios is shown by the difference, x 2 ~ 6.72 — 6.65 = 0.07, df. =* 4. 

EXAMPLE 9.2 — Here is another group of ratios for practice: 141-38, 83 28, 
98-24, 68 18, 67 24, 77:39. Chi-squares are 1 36, 0 00, 1 85, 0 76, 0 09, and 4 60, 
only the last indicates significant depai tines from the 3 1 population ratio The sum, 
however, x 2 — 8 66, df. — 6, is a moderate value (P — 0.20). Unless some defect 
could be discovered in the technique of handling the last group, its large chi-squarc 
would be attributed to the vagaries of sampling In the pooled data variation is not 
significantly great since x 2 = 0.21, df. = 1. Interaction chi-square, 8 66 — 0.21 
*= 8.45 with 5 d /., is not strong evidence against the hypothesis of sampling from a 
common 3:1 population. 


9.3 — An experiment with more than 2 classes of individuals. The 
9:3:3: 1 hypothesis. With liberation from the restriction of a single 
degree of freedom, all the theoretical genetic ratios may be tested, both 
in single samples and in combinations. As examples, two groups of pioge- 
nies are recorded in table 9.4, each being F 2 plants from a cross between 
japonica colored maize plants and fine-striped. The seedlings were ex- 
pected to segregate in 9:3:3T ratios. Hence, from the 168 plants in the 
first group, one would expect nine-sixteenths green, three-sixteenths 
japonica, etc. The formula for chi-square, section 1.8, specifies the sum 
of the four component parts: 

(117 - 94.5) 2 /94.5 - 5 36 

( 26 — 31 5) 2 /31 5 = 0 96 

( 18 — 31 5) 2 /31 5 = 5.79 

( 7 — 10 5) 2 / 10 5 = 1 17 

x 2 = 13.28 

This value of x 2 is based on three degrees of freedom, one less than the 
number of subsamples contributing. It is larger than the 1% point, 1 1.34. 
There is little chance, therefore, that the seedlings in this group arc a 
sample from a population segregating 9:3:3: 1 . That is, the sample departs 
significantly from the expected ratios. 

In the second group of progenies, x 2 = 9.82, d.J, = 3, is just at the 2% 
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TABLE 9.4 

F t Seedlings From a Cross of Japonica With Fine-Striped Maize 
Lindstrom’s data (13) 

Expected ratios, 9:3. 3:1. Expected numbers in parenthesis 


Progeny 

Number of 



Fine- 

Combi- 

Chi- 

Numbers 

Seedlings 

Green 

Japonica 

Striped 

nation 

square 

1,4,5 

168 

117(94 5) 

26(31 5) 

18(31 5) 

7(10 5) 

13 28 

2, 3 

135 

82(76 0) 

12(25 3) 

33(25.3) 

8( 8.4) 

9.82 

Total 

303 

| 199 

38 

51 

15 



point in table 9.2. The departure from expected is significant but less 
pronounced than in the first group. 

It is easy to see that the total chi-square, 13.28 + 9.82 = 23.10, 
d.f. = 6, is highly significant. One may be interested in this fact, also* 
the segregation of the entire 303 seedlings yields x 2 — 12.37, d.f. — 3, a 
highly significant departure from expected. Finally, the interaction has 
X 2 = 23.10 - 12.37 = 10.73, df. = 6 - 3 = 3, P = 0.02. The evidence 
is the same no matter what particular question is asked; the plants in the 
sampled populations do not segregate in 9:3:3 :1 ratios according to color. 

EXAMPLE 9.3 — In one of Bateson’s experiments (2) on the phenomenon now 
known as linkage, sweet peas with blue flowers (B) were crossed with others having 
round pollen grains (/), also, plants with red flowers (b) with those having long pollen 
grains (L) If these characters were assorted independently, the second generation 
plants from the cross Bl X bL would have segregated into the four classes- BL y Bl , 
bL, and bl with ratios 9*3 3 1. Actually, the observed numbers were 226 95.97.1. 
Compute x 2 = 32 36, dj. — 3 The hypothesis of independent assortment is clearly 
refuted. Students of genetics will lealize that linkage is the cause of the upset theory. 

9.4 — The adjustment of chi-square for small expected numbers. 

The reader will perhaps wonder why the adjustment for continuity has 
not been applied to the foregoing chi-squares. For all values based on a 
single degree of freedom it is appropriate and should be used if precision 
is desired in the evaluation of probability, especially in tables having 
small expected numbers — 5 or less. The reasons for the omission of the 
adjustment in the foregoing sections arc four: (i) no great accuracy in P 
has been required, (ii) the addition theorems do not apply to adjusted 
values, (iii) the adjustment described in section 1 13 is not appropri- 
ate for 2 or more degrees of freedom, and (iv) none of the expected fre- 
quencies dealt with have been smaller than 5. 

For unbiased evaluation of probability in tables having a single degree 
of freedom, adjust for continuity in the manner already explained; but 
if you wish to use an addition theorem, apply it to unadjusted chi-squares. 

Yates’ adjustment is intended to correct a bias; that is, a tendency 
toward an estimate different from the true value. In particular samples 
the adjusted value may not be so accurate as the unadjusted. Those who 
wish to test this for individual experiments with a single degree of freedom 
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will find an exact method explained in (21), (8), and (10). Different 
conclusions may occasionally be drawn if there are few observations in 
some cells of the table. 

9.5 — Testing chi-square if degrees of freedom exceed 30. If the 

degrees of freedom for chi-square exceed 30, the distribution is not far 
from normal. Fisher has suggested (8) that, as a good approximation, the 
distribution of table 8.6 be used with 

t = VV- VWM 

As an example suppose x 2 = 60 with df . = 50. Then 

t = V2(60) - V2(50) - 1 = 1.005 

The probability of exceeding this value in one-half of the normal curve is 
0.5000 — 0.3425 = 0.1575. For reasons explained in section 8.8, the 
customary value of P is double this, or 0.315: clearly, x 2 *= 60 is not 
unusual in random sampling from the chi-square distribution for 50 d.f. 

9.6 — A test of independence. After a sample has been divided on 
the basis of one attribute the individuals in each part are often subdivided 
according to a second. One question that immediately occurs is, “Are the 
two attributes related or are they independent?” If there are only two 
classes of each attribute, the 4 subsamples are conveniently presented in 
a 2 X 2 or fourfold table like 9.5, A. So far as this sample of farms is 
concerned (review sections 1.2 and 1.3; also example 1.24) the attributes, 
tenure status and need for labor, are related because 

100(290)/480 = 60.4% 


TABLE 9.5 

Subdivision of the South Dakota Sample Betwfen Owners and Renters 
A. Observed Frequencies 


First Attribute* 

Ability to Utilize 

Second Attribute* 

Tenure Status 


Owner 

Renter 

Total 

Labor to Increase Pigs 

Can 

290 

280 

570 

Cannot 

190 

240 

430 

Total 

480 

520 

1,000 


B. Probabilities Calculated From Border Totals 


Owner 


Renter 


Can (0 57) (0 48) « 0 2736 (0 57) (0 52) « 0 2964 

Cannot (0 43) (0 48) = 0 2064 (0 43) (0 52) - 0 2236 
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of the owners were able to utilize extra labor, whereas only 53.8% of 
the renters could. We wish to get evidence on whether the difference 
between these two percentages may be attributed to sampling variation or 
whether it reflects a population difference. It is instructive to note that 
the sample relation between the attributes is equally evident from the 
percentage of “cans” who are owners 

(100)(290)/570 = 50.9% 

and of “cannots,” 44.2%. Any way one looks at the table it is clear that, 
in the sample, ownership and ability to use extra labor are associated. 
Apparently these owners were better supplied with the necessary facilities 
for raising pigs (fencing, houses, feed, etc.), and could increase this enter- 
prise were labor available. 

To get evidence about the population, we set up the hypothesis that 
in it the two attributes are independent. The meaning of this term is clarified 
by reference to the probabilities in part B of table 9.5. Since the prob- 
ability that a farmer can use extra labor is 570/1,000 = 0.57, this number 
is entered in both columns of line 1. In like manner, the probability 
that he cannot, 430/1,000 = 0.43, is put in the second line. In both lines 
of column 1 is entered the probability, 480/1,000 = 0.48, that a farmer 
is an owner, while in both lines of column 2 is put down the probability, 
0.52, that he is a renter. Now, the hypothesis is that in the population 
these probabilities are independent. If they are, then according to 
probability theory the chance that a record be assigned to any cell of the 
table is the product of the two probabilities in that cell. Thus the prob- 
ability of being both a “can” and an owner is (0.57) (0.48) = 0.2736 
provided the two probabilities are independent. Notice the sum, 

0.2736 + 0.2064 + 0.2964 + 0.2236 = 1, 

indicating certainty that a record will fall in some one of the four sub- 
sarnples. 

The product probabilities lead immediately to the set of expected 
numbers in table 9.6; that is, numbers expected if the attributes are inde- 
pendent. Simply multiply each by the sample size, 1,000. These expected 
numbers, along with the four deviations, make up a chi-square index of 
dispersion having the same distribution as the one studied in chapter 1. 
The calculations are completed in table 9.6. For chi-square there are now 
four terms in the summation instead of two. Observe carefully that the 
expected numbers in each row and column add to the border totals and 
that the deviations add to zero. Since the squares of the deviations are all 
equal, the formula for chi-square takes the special form, 

X 2 = (X - rri) 2 S(\/ni), 

or in words, chi-square is the square of the deviation multiplied by the 
sum of the reciprocals of the expected numbers. 
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TABLE 9.6 

Calculation of Chi-Square in a 2 X 2 Table Under 
the Hypothesis of Independence 
The South Dakota Farm Sample of 1,000 


Ability to 

Utilize Labor 

Tenure Status 


Owner 

Renter 

Total 

Can 

Observed 

Expected 

290 

273 6 

280 

296 4 

570 


Deviation 

16 4 

-16 4 


Cannot 

Observed 

Expected 

190 

206 4 

240 

223.6 

430 


Deviation 

-16 4 

16 4 


Total 

: 

480 

520 

1,000 


Expected numbers: (570)(480)/l ,000 = 273 6* 

(570) (520)/!, 000 = 296 4 
(430)(480)/1,000 = 206 4 
(430)(520)/l ,000 = 223 6 

Total 1,000 0 


Or, 


X 2 ** S{X — m) 2 /m 

_ (164)’ (-1 6 4) 3 (-16 (16 4)* 

273 6 + 206 4 + 206 4' + "223 6 


= 0 983 + 0.907 4 1.303 + 1.203 - 4.40 


X 1 = (16 4)«- 


L + L. + JL + »_ 

273.6 T 296 4 T 206.4 T 223 6 


= (268.96)|o.016346j = 4 40 


* This compulation is somewhat shorter than that in tabic 9 5, B. 


Comparison of the calculated chi-square, 4 40, df =1, with table 9 2 
shows that only about 4% of random samples from the hypothetical 
population have larger values. The hypothesis of independence is suspect. 
It seems reasonable to reject it, concluding that owners in the population 
are better able than renters to use extra labor. 

Rejection of the hypothesis of independence leads to the conclusion 
that the sample of 1,000 is really drawn from a mixture of two different 
populations: a population ol owners in which 60.4% is an unbiased 
estimate of the fraction of those who can use help, and a population of 
renters in which the estimate is 53.8%. Alternatively it may be said that 
the two populations are these: (i) farmers who can use extra labor, among 
whom an unbiased estimate of the proportion of owners is 50.9%; and 
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(ii) farmers who cannot, with the estimate, 44.2%. For each population 
confidence limits may be set; also, if the necessary information is at hand, 
all estimates and limits may be expanded to population totals. 

In the formula of table 9.6 the adjustment of chi-square is easily made 
and should be used as routine procedure even though it would modify 
decisions only in occasional samples of small size. The deviation from 
expected, X — m, is decreased in absolute value by 0.5, the formula being, 

X 2 = (|A'-m|-0.5) 2 S(l/m) 

Applying this to the data of table 9.6, adjusted chi-square is 
(15.9) 2 (0.016346) = 4.13 

It may be well to emphasize the contrasts between this section and 
those preceding. Here is set up no hypothesis, such as 3:1, about a 
numerical relation between the frequencies of the attributes. The only 
postulate is that the incidence of the two sets of attributes is uncorrelated. 
In the 2-way table, the border totals are made the basis of probabilities 
that individuals will foil into the several subclasses. It follows that only 
1 of the 4 cell frequencies can be filled in at will — the other 3 are fixed by 
subtraction — so that chi-square is distributed with a single degree of 
freedom. Finally, since the border totals are used to determine the 
expected numbers in the cells of the table, there is no counterpart of the 
total and pooled chi-squares calculated in earlier sections: only the inter- 
action is now in question. The hypothesis of independence is identical 
with the hypothesis that there is no interaction in the population. 

The test of independence is equally a test of the hypothesis that the 
two samples — those in the 2 rows or those in the 2 columns — are randomly 
drawn from a common binomial population. This is often referred to, 
also, as a test of homogeneity. 


EXAMPLE 9.4 - In one of the experiments reported by Lindstrom 94 plants 
segregated in the ratio, 50 green to 44 yellow. Test the hypothesis that this sample and 
the 1,103 in table 9.1 are diawn from a common binomial population. Do not assume 
the 3*1 ratio, but make the test of independence or homogeneity Ans Adjusted chi- 
squat c is 26.22. 

EXAMPLE 9 5 - In the fall of 1943 approximately one in every 1,000 families of 
Iowa was asked about the canning of fruits or vegetables during the preceding season. 
Of the 370 rural families enumerated 357 had done canning, while among the 300 city 
(2,500 or more inhabitants) families 274 had canned Test the hypothesis that the 
probability of canning is the same in urban and rural communities. Ans. x 2 — 7.11. 

EXAMPLE 9 6 — Set 0 95 confidence limits for the percentage of eastern South 
Dakota owners who can utilize extra help. Ans. About 56% and 65% Compare 
this interval with that for the entnc sample 54-60 (section 1.3), got without the addi- 
tional information of table 9.5. 

EXAMPLE 9.7 — On the assumption that 25,000 farmers in eastern South Dakota 
are owners or part owners, expand the foregoing estimate and limits. Ans. An esti- 
mated 15,100 owners can utilize extra help with 95% confidence limits, 14,000 and 
16,200. 
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9.7 — An alternative formula for chi-square in a fourfold table. 

Compared to that of table 9.6, this formula is usually easier of computation, 
especially if a calculating machine is available and if the short-cut 
methods of multiplication and division are used. The numbers in the 
four cells of the table, and in the borders, may be symbolized thus, 

a b a T* b 

c d c + d 

a + c b + d n 

where n — a + b + c-\-d. The formula is then written, 

j (at/ - bc)*n 

X (a + b){c + d){a + c)(b + d) 

Let us apply this to the data in table 9.7, taken from a study of the 
migration of young people from rural homes in Iowa. 


TABLE 9.7 

Children Over 16 Years of Age Who Have Left Their Rural Homes. 
From Wakfley (20) 


Location 

Males 

Females 

Total 

Percent Males 

Open country 

607 

726 

1,333 

45 54 

Village 

231 

266 

497 

46.48 

Total 

838 

992 

1,830 

45.79 


, (726 X 231 - 607 X 266) 2 (1,830) 

X ~ (1,333) (497) (838) (992) 

The numbers involved will be cumbersome unless only the first four or 
five digits are retained throughout the calculations. The decimal is easily 
fixed by dividing both numerator and denominator by some convenient 
power of 10: that is, by moving all decimals to the left an equal number of 
places in numerator and denominator so that only small groups of digits 
are involved in the calculations. In this example, after having evaluated 
the binomial, 

726 X 231 - 607 X 266 - 6,244, 

the decimals may be moved a total of eleven places in each term of the 
fraction: 

2 « (0-6244)(0.6244)(1.830) = 0.7135 

X (0.838) (0.992) (1.333) (4. 97) 5.507 

This value of chi-square indicates that, in the population, the ratio 
of the sexes is probably independent of the location of the home. The 
distinction between open country and village may therefore be abandoned 
and the totals of males and females used to arrive at an unbiased estimate 
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of the percentage of males, 45.8%, with confidence limits 43.5% and 48.1% 
(see chapter 16 for computation of confidence intervals in large samples). 

In the formula above, the chi-square adjustment is made by sub- 
tracting half the total sample size from the absolute value of the binomial 
in the numerator before it is squared: 

2 (\ad — be | — n/2) 2 n 

X (a + b){c + d)(a + c){b + d) 

Use of the formula is illustrated by the records (table 9.8) of clinical 


TABLE 9.8 

Numbers of Takes and Failures of Inoculations With Vivax Malaria in the 
Months January to March Contrasted With April to December 



Result of Inoculation 


Months 

Take 

Failure 

Total 

January-March 

13 

6 

19 

April-Dcccmber 

142 

10 

152 

Total 

155 

16 

171 


attacks experienced by 155 white patients therapeutically inoculated by 
anophelines infected with the McCoy strain of Plasmodium vivax , employ- 
ing sporozoites not exceeding 50 days of age (3). 

o (142 X 6 - 13 X 10 - 85.5) 2 (171) _ 

(19)(152)(155)(16) 

This would indeed be an unusual sample from a homogeneous population 
Apparently, the success of inoculation with Vivax malaria is not inde- 
pendent of season. 

If adjustment is not made, x 2 = 12.45, a value with probability of 
being exceeded only one-fourth that of the adjusted (section 8.8). 


EXAMPLE 9.8 — Dr. C. H Richardson of the Iowa Agricultural Experiment 
Station has furnished the following numbers of aphids ( Aphis Rumicis L.) alive and dead 
after being sprayed with two solutions of sodium oleate: 


Concentration of 
Sodium Oleate 
(percent) 

i 

i 

Alive 

Dead 

Total 

Percent 

Dead 

1.10 

3 

62 

65 

95 4 

0 65 

13 

55 

68 

80 9 

Total 

16 

117 

133 



Calculate adjusted chi-square, 5 31. Estimate the 95% confidence intervals, 87% to 
99%, for the population percentage dead in the higher concentration; and 72% to 
92% in the lower. 
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EXAMPLE 9.9 — Here are data on the germination of samples from two bags of 
blue grass seed 


Bag Number 

Number 
of Seeds 
Germinating 

' i 

j 

Number Not 
Germinating 

Total 

Percent 

Germinating 

1 

340 

60 

400 

85 

2 

356 

44 

400 

89 

Total 

696 

104 

800 

87 


Calculate adjusted x 2 ~ 2 49. Since the two bags were from the same shipment, the 
difference between the percentages of germination may be attributed to sampling 
variation, and 87% considered as an unbiased estimate of the percent germination in 
the two bags. Set 0 99 confidence limits to the population percentage Ans. 84% -90%. 

EXAMPLE 9 10 — In 1933 Davis (6) was trying to determine the causes of unusual 
development of smut ( Ustilago zrac) in corn plants. Since all the plants were inoculated, 
the chances of infection seemed equal Bags were placed over the ears of 174 plants 
just before silking, thus preventing pollination and inhibiting development. Of these 
plants, 38 were smutted. Among 262 unbagged plants 31 were found smutted on 
September 15 Chi-square is 7.13, indicating a significant difference in the percentages 
of smutted plants, attributable to the effects of bagging. 

EXAMPLE 9.11 — In examining the effects of different sprays in the control of 
codling moth injury to apples, Hansbeny and Richardson (12) counted the wormy 
apples on each of 48 trees, Two trees sprayed with lead arsenate yielded. 

A: 2,130 fruits, 1,299 or 61% of which were injured 
B* 2,190 fruits, 1,183 or 54% of which were injured 

Chi-square = 21.16 is conclusive evidence that on these two trees the chance of injury 
was different. Yet they were treated in all respects alike This is characteristic of spray 
experiments. For some unknown reasons, lnjuiies under identical experimental treat- 
ment differ significantly. Hence, it is impossible to comjiare sprays on single trees, 
because a difference in percentages of injured apples might be attributable either to 
the treatments or to the unknown sources of variation. A statistical deter mination of 
the homogeneity or heterogeneity of experimental material under identical conditions 
is called a test of technique. Hansberry and Richardson combined a study of technique 
with their experiment on sprays 

EXAMPLE 9.12 — The Bureau of Entomology and Plant Quarantine, United 
States Department of Agriculture, has been investigating the effectiveness of various 
treatments for killing cabbage worm loopers. From some unpublished data suj^plied 
by Messrs. D J Caffrey and G. E. Smith (5), we have taken the records of the numbers 
of plants found infested after two treatments. 

Treatment A: 200 plants examined, 24 found infested 
Treatment B- 200 plants examined; 9 found infested 

Chi-square = 6.47 shows that these samples were not likely taken from the same 
population. Evidently the treatments caused differences in the numbers of infested 
plants, treatment B being more effective in killing the loopers. 

9.8 — Three sets of attributes — the 2x2x2 table. Not infrequently 
observations are made on more than two attributes; for example on sex, 
on treatment and on species. If there are only 2 classes of each attribute, 
independence may be tested by a method due to Bartlett (1). An example 
is found in an experiment by Gowen (11) on the susceptibility of strains 
of mice to different isolations of typhoid organisms, table 9.9. 
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TABLE 9.9 

Numbers of 2 Strains of Mice Dead and Alive 21 Days After Inoculation 
With 2 Isolations of Typhoid 



Strains of Mice 



RI 


Ba 

Isolations 

Dead 

Alive 

| Dead 

Alive 

9D 

260 

46 

322 

27 

11C 

164 

145 

336 

10 


Each of the fourfold tables may be examined for independence. In 
the Ba strain, for example, 

o (336 X 27 - 322 X 10 - 695/2) 2 (695) 

y 2 — — 7.16 

* (349) (346) (658) (37) 

Gleaily the mice in this strain do not react alike to the two isolations of the 
typhoid organism. This absence of independence has been referred to 
as interaction: chi-square tests the hypothesis that there is no interaction 
between survival and the two isolations of the organisms. 

In the theory being followed, the estimate of interaction is the ratio of 
a/b to c/d, or alternatively, the ratio of a/c to b/d * each ratio is ad/ be. 
Among the Ba mice, this interaction is (322) (10)/ (27) (336) = 0.3549. 
By comparing the interaction ratio with the expression for chi-square, 
you will sec that, aside from the continuity adjustment, if x 2 == 0 then 
ad /be = 1. In this sense, the hypothesis tested by chi-square is that in the 
population the interaction ratio is unity. 

One more observation about the fourfold table: if the 4 deviations, 
A' — m, are represented by *, we learned that |*| is the same for the 4 
classes, and that Sx = 0. Consequently, it is easy to construct a table of 
expected numbers in which the interaction ratio is 1 : 

a — x b + x 

c + x d — x 

In verification, refer to table 9 6 in which |*| = 16.4. The expected 
numbers in the 4 classes may be written, 

290 - 16.4 280 + 16.4 

190 + 16 4 240 - 16.4 

Furthermore, 

(290 - 16.4)(240 - 16.4) _ (273.6) (223.6) __ 

(280 + 16.4X190 + 16.4) ~ (296.4) (206.4) 
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It follows that x could be computed by solving the equation, 

(a - x)(d - x) = 

(6 + x)(c + x) 

which is equivalent to the equation, 

(a - x)(d - x) = (b + x)(r + x) 

If * were calculated in this way, the convenient formula for adjusted chi- 
square would be, 

x 2 = (1*1 - 0.5)* (— — + — j- + -J- + rd-} 

\a — x b + x c + x d — x) 

In the 2X2X2 table, the interaction in one fourfold part may be 
different from that in the remainder. The RI mice, for example, interact 
to the isolations in the amount, 

(260) (1 45)/(46) (1 64) = 4.997 

whereas the interaction was 0.3549 for the Ba strain. The ratio of these 
first order interactions is known as the second order interaction, 

4.997/0.3549 = 14.08 

If the" table is written in the symbolical form, 

a b a! b f 

c d c' d' 

the second order interaction is 

ad la'd' , , , , , 

7 - — = adb c /a d be = 14.08 
be b c 

In order to test the hypothesis that in the population the second order 
interaction is 1, the value of the deviation, |x|, common to all 8 cells of 
the table, is determined by solving the equation, 

(a — x)(d — x)(c f — x){b' — x) = (c + x)(b + x){a f + x)(d' + x) 

The equation is easily remembered if it is noticed that in the left member 
the letters follow a diagonal downward in the first 2X2 table, then across 
and upward in the second. Substituting the mouse data, 

(260 — x) (145— x) (336— x) (27 — x) = (164+x) (46+x)(322+x)(10+x), 

which reduces to 

1,31 Ox 8 - 113,303x 2 + 20,540,068x - 317,722,720 = 0 
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Methods for approximating the real root of this cubic may be found 
in any college algebra. Essentially, they consist in trying various values 
of x until two are found between which the left member of the equation 
changes sign. A first approximation may be got by ignoring the first two 
terms, solving the equation, 

20,540,068* - 317,722,720 = 0 

From this, * = 15 roughly. If you try 15, 16, and 17 in the cubic, you 
will find that the left member is —12,721,440 for * — 16 and 5,149,899 
for * = 17. This leads to the improved approximation, 

* - 16 _ 12,721,440 
17 - * “ 5,149,899 

Solving, * = 16.7. If a still better approximation is desired it will be 
found that the left member is —22,868 for * = 16.71 and 155,635 for 
* = 16.72: the equation, 

* - 16.71 22,868 

16.72 - * ~ 155,635 

gives * = 16.711, a more accurate approximation than is ordinarily 
needful. Verify the result by substitution in the original equation. Note: 
x may be positive or negative in other examples. 

The 8 expected numbers, (260 — *),... (10 + *), are now calculated, 
whence adjusted chi-square, with 1 degree of freedom, is 

v 2 = (16.711 - 0.500) 2 | - 1- ... 4 — I 

X V ) \ 243.289 26.711 / 

= 45.75 

This is very strong evidence that the interaction between survival and 
isolation in the RI strain is different from that in the Ba . Equally it may 
be said that the interaction between strain and survival differs with isola- 
tion; also, though it has little meaning in this illustration, the isolation- 
strain interaction is not the same for the living and dead. 

If, in other circumstances, the investigator is willing to accept the 
hypothesis of no second order interaction ( adb'c ’ = a'd'bc in the popula- 
tion) it would be appropriate to combine the data into one (or more) of 
the 3 summary fourfold tables, then test the corresponding first order 
interaction (example 9.13). 

EXAMPLE 9.13 — In studying the isolating mechanisms of two species of Droso- 
phila, Mayr, and Dobzhansky (16) confined males of D perswulis foi several days with 
females of the same species, pro-conditioning them, then gave them access to females 
of both their own species and of D. pseudoobscura They then confined persimihs males 
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with pseudoobscura females, counterconditioning them, following this with multiple 
choice as before. The numbers of females inseminated are recorded. 



Pro-conditioned 

Counter-conditioned 


j Inseminated 

Not 

Inseminated 

Not 

Perstmihs 

18 

14 

41 

6 

Pseudoobscura 

5 

32 1 

20 

32 


The equation, ( x -f 18)(.y -f- 32) Cat 4- 20) (a: 6) = ( x — 5) (at — 14) (at — 41)(* — 32), 

has the approximate root, 0.45, hence chi-square (you can’t adjust it) is small, 0.13. 

If the hypothesis of negligible second order interaction be accepted, it becomes 
interesting to examine the interaction in the condensed table. 


Females 


Inseminated 

Not 

Perstmihs 



59 

20 

Pseudoobscura 

' 


25 

64 


The adjusted chi-square value, 34.5, leaves no doubt that, under the conditions speci- 
fied, males inseminate the females of their own species more freely. 

EXAMPLE 9.14 — Under the hypothesis of independence, the probabilities in the 
4 cells of a 2 X 2 table such as table 9 5B, may be written 

p\ pi 

pi pi 

Show that pi pi = p 2 p&. Correspondingly, in the 2X2X2 table, 

p\ p2 pb p6 

pi pi pi pi 

verify the fact that p x pi p 7 = p z pip^ p t . 

9.9 — Test of independence in an R x C table. If one or both of two 
attributes are recorded in more than two classes, independence may be 
tested by chi-square with (R — 1 )(C — 1) degrees of freedom Based on 
the border totals, 2 probabilities of occurrence may be calculated for 
every cell of the table Under the hypothesis of independence, the products 
are probabilities that an observation shall fall in the several compart- 
ments, and the probabilities lead to expected numbers, m. Finally, 
X 2 = S(X — m) 2 /m. The hypothesis of independence is identical with the 
hypothesis that there is no interaction in the population. 

Strand and Jessen (18) examined the distribution of tenancy on various 
levels of soil fertility in Audubon County, Iowa. One question was, “Is 
tenancy dependent on fertility; that is, is there a tendency for some forms 
of tenancy to be prevalent on a particular fertility level?” A sample is 
summarized in table 9.10. 

Calculation of the expected numbers follows one of the patterns already 
explained. In soil 7, for example, the probability of an observation is 
152/517 = 0.294003: this, multiplied in succession by 125, 214, and 178, 
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TABLE 9.10 

Numbers of Farms on 3 Soil Perth ity Groups in Audubon County, Iowa, 
Classified According to Tenure 


Soil 

: 

■ 

Owned 

Rented 

Mixed 

Total 

I 

A' 

36 

67 

49 

152 


m 

36 75 

62 92 

52 33 



X - m 

-0 75 

4 08 

-3 33 


II 

X 

31 

60 

49 

140 


m 

33 85 

57 95 

48 20 



X — m 

-2 85 

2 05 

0 80 


III 

X 

1 58 

87 

80 

225 


m 

| 54 40 

93 13 

77 47 



X - m 

3 60 

-6 13 

2 53 


Total 


125 

214 

178 

517 


(x — m ) 2 


M>75)* 

36 75 ' 


(2 53)* 
77 47 


1 54, dj - (R - 1)«7 — 1) 


gives the expected numbers, 36.75, 62.92, and 52.33. Verification of 
the expected numbers of owned farms could be had by multiplication 
of 125/517 = 0.241779 by the totals in the right-hand border, but this 
is usually not necessary because (i) the sum of the expected numbers in 
each row and column must equal the border total, and therefore (ii) the 
sum of the deviations in each row and column must be zero: hence, verifi- 
cation is easy. From the deviations and the expected numbers, chi-square 
is calculated as indicated. Clearly, there is no reason to reject the hy- 
pothesis of independence. 

Two observations: (i) If one or more cells of the table contribute 
unusually large values of {X — mY/m , they may warrant special con- 
sideration. (ii) Since there is no evidence of interaction, it would be 
proper to test any hypothesis the investigator might have about the 
border totals. For example, if he should know that in the state as a whole 
tenancy is distributed in the ratios 25:45:30, he could test (as in section 
9.3) whether his sample is randomly drawn from the state population or 
if Audubon County has a different pattern from that in the state. 

9.10 — A special method of computing chi-square in an R x 2 table. 

In scientific writings many tables of enumeration data are presented in 
some such fashion as the first three columns of table 9.11. These data 
are selected from an experiment in which uniform doses of Danysz 
bacillus were injected into rats. The sizes of the subsamples were dictated 
by the numbers of animals available at the several dates of injection. 
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TABLE 9.11 

Percentage of Rats Living After Injection With Danysz Bacillus 


Number 
in Group 

Number Living 

X 

Probability * 
of Survival 

P 

Products 

px 

40 

9 


0.2250 

2 0250 

12 

2 


0 1667 

0.3334 

22 

3 


0 1364 

0 4092 

11 

1 


0 0909 

0 0909 

37 

2 


0 0541 

0 1082 

20 

3 


0.1500 

0 4500 

142 

SX = 20 

p = 20/142 

* 0 1408 

SpX - 3.4167 



9 = 1 ~ P 

« 0 8592 



2 _ SpX - pSX _ 3.4167 - 2 8160 
: pq “ (0.1 408) (0.8592) 


4.97, dj. « 5, P - 0.43 


* If you prefer percentages, see example 9.18. 


The probabilities of survival, p, in the third column of the table are 
emphasized. For such tables a convenient method of computing chi- 
square has been devised (17). (This is sometimes referred to as the 
Brandt and Snedecor method). The results, of course, are the same as 
those obtained by applying the method of the foregoing section. The 
second and third columns might as well refer to the numbers dead, but 
computation is easier with the smaller probabilities. In the last column 
the sum of the products is usually accumulated in the dials of a calculating 
machine with no listing of the separate items. 

The results indicate that the probability of death is independent of the 
group and may therefore be characteristic of the amount of inoculum 
injected. The experimental technique was rather difficult to control — 
the rats were probably variable in their reaction to the injected organism, 
the organism doubtless changed somewhat in virulence, and the amounts 
of the inoculum introduced were likely not entirely uniform — yet the 
probability of death throughout the sample was no more divergent than 
would be expected in groups randomly drawn from a population with 
constant probability of death. 

This special method may be convenient if crossing over is suspected 
in genetic material which should theoretically segregate r: 1. For the sake 
of easy comparison, let us apply it to the data of table 9.1, rearranged as 
in table 9.12. The new features are in the lower part of the table where 
chi-square is computed on two assumptions: (i) that the population ratio 
is 3:1, the value of chi-square being identical with the interaction chi- 
square calculated as a difference in section 9.2; and (ii) that there is no 
theoretical ratio. What are the advantages? 

First, since x 2 = 3.08, d.f. = 10 is so small as to indicate no heteroge- 
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TABLE 9.12 

Segregation of Green and Yellow Seedlings in 11 Progenies. Data of Table 9.1 
Alternative method of computation 


Number 

Examined 

Number 

Yellow 

X 

Probability 
of Yellow 

P 


Products 

px 

122 

24 

0.1967 


A 7208 

149 

39 

0 2617 


10 2063 

86 

18 

0 2093 


3 7674 

55 

13 

0 2364 


3 0732 

71 

17 

0 2394 


4 0698 

179 

38 

0 2123 


8 0674 

150 

30 

0 2000 


6 0000 

36 

9 

0 2500 


2 2500 

91 

21 

0 2308 


4 8468 

53 

14 

0 2642 


3 6988 

111 

26 

0 2342 


6 0892 

1,103 

SX - 249 

p = 0 22575 

SpX - 

56 7897 



<7= 0 77425 

pSX - 

56 2118 



SpX 

- psx - 

0 5779 


Assuming 3:1 ratio, p = 0.75, q — 0 25 

2 _ SpX - pSX __ 0.5779 

* ~p q ' (0 75) (0 25) 

With no assumption about ratio, 


3 08 


2 * s p x ~ v^L . _ L 577 ? , „ 

x jiq (0 2258) (0.7742) " 


neity among the sample ratios, the pooled chi-square is of interest and may 
conveniently be calculated by means of the formula (section 1.15), 

, = ( « ~ rb'f = (854 - 3 X 249)» = 

X r(a + b ) 3(1,103) 

We now have both the pooled and the interaction chi-squares which are 
essential: the total chi-square may be got by addition if it is needed. 
The calculations are notably less burdensome than those of table 9.1. 

Second, we have as before the evidence to decide whether there is 
crossing over or not. While pooled chi-square is short of the 5% point, 
it does emphasize a bias — a deficit in the recessive yellows. If the investi- 
gator attributes this to crossing over (or to some other genetic phenome- 
non) he will reject the 3:1 hypothesis. He then has available the appro- 
priate interaction x 2 — 3 31 based only on the assumption of indepen- 
dence, and this with a negligible amount of extra computation. 

Fisher and Mather have made further adaptations of this method to 
test homogeneity in data with hierarchical subdivisions (8) (9). 
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EXAMPLE 9.15 — In another sample of inoculated rats the data were: 25, 21; 
50, 48; 20, 15; and 20, 17 — the first number in each pair being the subsample size, the 
second, the number dying. Calculate x 2 ” 6.69, P = 8.6%. 

EXAMPLE 9.16 — Combine the values of chi-square for the 2 samples of rats, 
yielding x 2 - 11.66, </./.= 8, JP « 17%. 

EXAMPLE 9.17 — If you are willing to accept the hypothesis that each sample of 
rats was drawn from a homogeneous population, it is appropriate to test the hypothesis 
that the probabilities of death in the twojpopulations are the same; that is, that the 
two sample probabilities, 85.9% and 87.8%, differ only as may be expected in random 
sampling from a common population. Ans. Adjusted x 2 — 0 070 (section 9.7). 

EXAMPLE 9.1 8-- Apply the method of table 9.12 to the data of example 9.1, 
using percentages instead of probabilities. Note the slight modification in the formula. 


Number 

Number Yellow 

Percentage Yellow 



in Group 

X 

P 



139 

41 

29 4964 * 



102 

31 

30 3922 



179 

52 

29 0503 



86 

25 

29 0698 



122 

36 

29 5082 



628 

SX = 185 

V * 29 4586 

SpX = 5,451 

17 



q = 70 5414 

pSX - 5,449 

84 




1 

C/^ 

33 


For testing interaction fiom 5 1, x 2 = \j 5 y — 0 071 

tj. > , . 2 K)0(Sj>X - pSX) 

For testing independence, X' - -- — . = 0 064 

M 


* I carried 2 extra digits in ordci to verify the result in example 9.1. 


Since we know from the earlier computations that pooled chi-square is significant 
at the 1% point, some such bias as crossing over may be decided upon. If so, the 3*1 
hypothesis is abandoned and interaction calculated as 0.064 Clearly, interaction is 
negligible under either hypothesis — 3’1 or independence. 

EXAMPLE 9.19 - Burnett (4) tried the effect of five storage locations on the 
viability of seed corn. In the kitchen garret, 111 kernels germinated among 120 
tested, in a closed toolshed, 55 among 60, in an open toolshed, 55 among 60, out-of- 
doors, 41 among 48; and in a dry garret, 50 among 60. Calculate x a = 5 09, df. = 4, 
P — 28%. The calculations are somewhat shorter if the numbers of nongerminating 
seeds are used The results contain slight evidence against the hypothesis that there is 
a uniform probability of germination, irrespective ol location. 

EXAMPLE 9 20 -- Among some experiments reported by Lindstrom (14), the num- 
ber of ears of corn displaying the presence of a recessive gene producing sugar was 18 
among 33 having 8 rows of kernels, 37 among 63 with 10 rows, 27 among 70 with 12 
rows, and none among 4 with 14 rows. It is easy to see that the expected number of 
sugary ears among the last four is less than five, the minimum expected number that 
should be used in applying chi-square. The 12-row and 14-row ears, may, therefore, 
be merged into one subsample having 27 sugary ears among 74. If this is done, 
X 2 es= 7.4, d /. = 2, and P = 3%. The evidence is against homogeneity, though not 
strongly so. Apparently the sugary gene is associated with ears having few rows of 
kernels. 



Section 9.11 Attributes : Multiple Degrees of Freedom 


209 


EXAMPLE 9.21 — Using the data of the preceding example, calculate chi-square 
by means of the formula of section 9.9. Since 48 24% of the total 170 ears were sugary, 
one would expect 48.24% of the 33 eight-row ears to bear the gene; that is, 15.92 ears. 
The deviation from expected is 18 — 15.92 = 2.08 ears. Similarly, five more devia- 
tions from expected are calculated (3 pairs in each of which the deviations are equal 
in absolute value but opposite in sign), the resulting chi-square being 7.4 as before. 

9.11 — An experiment requiring a test of technique. When novel 
experimental methods are used, the investigator must determine, among 
other things, whether he can reproduce his results; that is, whether he 
has adequate control over the conditions under which the experiment is 
performed. Reference to this problem was made in the section pre- 
ceding. If results cannot be verified under controlled conditions assumed 
to he identical, then it is idle to try the effect of changing these condi- 
tions — one cannot know whether differences (or likenesses) in the results 
are to be charged to the controlled situation or to those unknown causes 
that elude control. 

Decker and Andre (7) were faced with this problem when they started 
to investigate the effect of a short, sudden exposure to cold, the adults 
of the chinch bug being the organism studied. The experimental insects 
had to be gathered in the field, so that the degree of heterogeneity was 
unknown. Ten individuals were placed in each of 50 tubes and exposed 
for 15 minutes at —8° G. Since the chi-square method may not yield 
accurate results unless the expected numbers are all at least as great as 
five, the counts of the numbers dead in the individual tubes were com- 
bined at random in 5 lots of 10 tubes each; that is, lots of 100 nymphs. 
The mortalities were 14, 14, 23, 17, and 20 insects. From these data* 
X 2 == 4.22, dj . = 4, and P = 39%. This indicates about the variation 
that would be expected in sampling from homogeneous material. The 
mean probability of death was 17.6%. The results are in accord with the 
hypothesis that every individual was subject to this same chance of being 
killed by the exposure. 

In a second sample of 500 adults, handled in the same manner except 
that they were exposed at —9° C., the mortalities in groups of 100 were 
38, 30, 30, 40, 27. The chi-square of 5 79 verifies the technique and shows 
only sampling variation from the estimated mortality of 33%. While 
no particular advantage is gained in this case, it is interesting to add the 
two values of chi-square, x 2 = 4.22 + 5.79 = 10.01, d.j. — 8, P = 27%. 

The gratifying uniformity in the experimental results leads one to 
place confidence in the rather surprising finding that the mortalities 
at — 8°C. and — 9° C. were significantly different. The total numbers 
dead in the two samples of 500 were 88 and 165. The value, % 2 — 31.4, 
dj. = 1, p less than 0.0002 (section 8.8), constitutes convincing evidence 
that a rise in mortality with the lowering of temperature from — 8° C. 
to — 9° G. is a population characteristic, not merely an accident of 
sampling. 

The ease of applying such a test of experimental technique makes its 
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use almost a routine procedure except in highly standardized processes. 
It is necessary merely to collect the data in several small groups instead 
of in one mass. The additional information available may modify con- 
clusions profoundly, 

EXAMPLE 9.22 — Dr. R. H. Walker of the Intermountain Forest and Range 
Experiment Station has provided the following unpublished data on counts of bacteria 
in suspensions of soil and indigo particles, following the method of Thornton and Gray 
(19). The number of indigo particles in a field specifies the amount of soil present. 
Good technique will be indicated if the numbers of bacteria vary from proportionality 
to the indigo particles only so much as would be expected in random sampling from a 
population in which this ratio is constant On a slide, 16 fields were counted, four in 
each of four drops. The data are records of the total slide counts, four slides to each 
soil suspension. For purposes of computation, we shall give only two figures: (i) the 
total count of both bacteria and indigo particles, and (ii) the number of bacteria. 
For the four plates of soil suspension 1: 204, 78; 260, 75; 246, 76; 278, 95. Chi-square 
« 5.22, d.f. = 3 , /> = 16%. For suspension 2: 260, 60; 196, 50; 198, 45; 186, 50 
Chi-square *■ 1.30, dj . — 3, P = 73%. For suspension 3: 177, 22; 177, 23; 150, 16; 
177, 20. Chi-square = 0.53, df. = 3, P ~ 91%. For suspension 4* 289, 46; 356, 63; 
281, 45; 250, 42. Chi-square = 0.48, d.f. — 3, P = 92%. The agreement is closer 
than one would ordinarily expect in random sampling. 

EXAMPLE 9.23 — To determine whether the four soil suspensions differed sig- 
nificantly in the numbers of bacteria per unit volume, the totals are taken from the 
records of the foregoing example, for suspension 1*988, 324, 2 840, 205; 3.681, 81, 
4*1176, 196. Chi-square = 130, df. ** 3, P less than 1% There is no doubt of differ- 
ences among the four soil suspensions. 

EXAMPLE 9.24 — In computing chi-square by the method of table 9 11, the 
numbers of observations in the groups may be either equal or different If they are 
equal, the mathematician can easily reduce the formula to the special one x 2 “ 
kSx 2 /£(k — x), where k is the size of the subsample, Sx 2 is the sum of the squares of the 
deviations from mean, £, obtained in the usual manner by using the formula, Sx 2 

SX 2 — (SX) 2 /n. Those who work extensively with chi-square wall find some saving 
in time by using this special formula when their group numbers arc equal. They may 
practice by applying it to the Decker and Andre data. This formula w r as fust used 
by i£ Mathctes” (15). 

9.12 — An experiment in which the presence of an attribute is 
recorded quantitatively. In the enumeration data heretofore considered, 
the mere presence of some attribute was recorded. There is another kind 
in which presence may be observed in more than one unit, so that the 
record becomes quantitative even at this initial stage. Both kinds are 
employed in many studies of insect populations. In investigations of the 
intensity of attack by codling moth, for example, the fruits may be desig- 
nated qualitatively as merely injured or free; or they may be described 
as containing 0, 1, 2, etc., larvae. The latter kind of data often follows 
a distribution known as the Poisson, further account of which will be 
given in chapter 16. An example is found in an experiment designed to 
investigate various treatments for the control of cabbage loopers (5). 
Certain parts of the data are summarized in table 9.13. The original 
record of the counts on the individual plants in plot 2 of treatment four 
was like this: 0, 1, 0, 0, 1, 2, 1, 0, 0, 0, 0, 4, 0, etc. In the four plots receiv- 
ing one of the treatments not entered in the table only a single looper 
was found. For this illustration we selected only those treatments yielding 
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TABLE 9.13 

Number of Loopers Observed on 50 Cabbage Plants in a Plot 
Four plots treated alike. Five treatments 


Treatment 

Number of Loopers 
in Each of 4 Plots 

X 

Total for 
Treatment 

SX 

Mean 

£ 

Chi-square 

Sx */i 

1 

11, 4, 4, 5 

24 

6 00 

5.67 

2 

6, 4, 3, 6 

19 

4.75 

1 42 

3 

8, 6, 4, 11 

29 

7.25 

3 69 

4 

14,27, 8,18 

67 

16.75 

11 39 

5 

7, 4, 9, 14 

34 

8.50 

6 24 

Total 


173 


28 41 


Computation Jot treatment 1 

A** — SX* — (SXy/n = (11)* + (4)* + (4)* + (5)* - (24)*/4 = 34 
x* = Sx*/£ - 34/6.00 = 5.67, d.f. » 3, P = 14% 


Compulation for Fine Treatments 

Sx* - SX* - (SX)Vn = (24)* + (19)* + (29)* + (67)* + (34)* - (173)*/5 = 1437.2 

t = 173/5 = 34.6 

X 2 = AV*/f - 1437 2/34 6 - 41.5, df. = 4, P leas than 0.0002 


at least five as the expected number for each plot. On plots with less 
loopers there was even greater uniformity than on those selected for this 
investigation. 

For treatment 1, the expected number in each plot is simply the mean 
of the four plot counts, 6 loopers. The deviations from expected are, 
therefore, just the same as deviations from mean in measurement data, 
11 — 6, 4 — 6, etc. To make up chi-square, each of these deviations must 
be squared, then divided by the expected number. Since the latter is the 
same for the four plots, division may be delayed until the addition has 
been completed. Thus, 

„ (5) 2 + (-~2) 2 4- ( — 2) 2 + f — 1) 2 34 

* ‘ 6 7 “ 5 67 

In the table, the computation is done by the usual machine method. It 
is easy to see that the formula used is merely an adaptation of the cus- 
tomary formula for chi-square. 

For treatment 1, the value of chi-square, computed in the table, would 
be exceeded 14 times in 100 trials from homogeneous material. The 
uniformity of infestation need not be questioned. The sum of the chi- 
squares for the five treatments is 28.41, d.f. = 15, P = 2%. This indi- 
cates significant variation from uniformity of infestation among plots 
treated alike. From the original records, this lack of homogeneity seemed 
to be confined chiefly to the plots more severely attacked. However, the 
departure from uniformity appears to be slight compared with the varia- 
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tion induced by the treatments. This is shown by the last computation in 
the table where five treatment totals are examined. The chi-square of 
41.5, d.j. = 4, leaves little doubt of the treatment effects, even though an 
unknown amount of variation may have been introduced by the hetero- 
geneity of the plots treated alike. 

EXAMPLE 9.25 — Davis (6) counted the number of nodal smut galls in the corn 
plants of eight plots. The plants of four plots had been inoculated with the organism 
producing the galls. The number of galls observed m these plots were 56, 60, 41, 
and 75. Chi-square = 10.1, df — 3, P = 2%. In the four plots whose plants were 
not inoculated, the numbers ol galls were 20, 12, 26, 22. Chi-square = 5.2, d.f. = 3, 
P ~ 17%. For the two groups, x 2 — 15 3, d.f. — 6, P = 2% These figures indicate 
a significant lack of uniformity among the plots treated alike. Nevertheless, since the 
heterogeneity is not great, it is proper to test the difference between the totals for the 
two groups of plots. For the sums, 232 and 80, x 2 = 74, df. = 1, Pless than 1 %. There 
is little question of a population difference between the totals. 

EXAMPLE 9.26 — In example 9.22, soil suspension 1, ignore the numbers of indigo 
particles and compute, for the bacteria alone, x 2 — 3.3, df. = 3, P ~ 36% Neither 
the bacteria nor the ratios of bacteria to totals depart significantly from sampling 
variation in a homogeneous population. The ratios, as shown by the larger chi-square, 
are more variable, possibly because the test is more critical; the former test was based 
on the variation of both bacteria and indigo particles. 

EXAMPLE 9.27— -Davis (6) counted the numbers of cars on two plots of corn whose 
plants had been inoculated with an organism producing smut, and also on two check 
plots The numbers were 436 and 432, respectively The obvious failure of the inocula- 
tion to affect the number of ears is verified by unadjusted x 2 “ 0-02, df. = 1. The 
numbers of smutted ears, however, were 69 and 39. That the percentages differ 
significantly is shown by adjusted x 2 = 8 59, df. — 1. 

9.13 — Summary. The methods of chapter 1 have been extended to 
more than 2 classes (e.g., the 9 : 3 : 3 : 1 genetic ratio, the 2X2 and the 
2X2X2 tables) and to 2 or more samples. These samples may be from 
populations with or without theoretical ratios. The Poisson distribution 
has also been introduced. 

If 2 or more samples are drawn from populations with theoretical 
ratios, 3 different hypotheses may be tested: (i) that the pooled sample 
is from the specified population, (ii) that the several samples may all 
have come from the specified population, and (iii) that there is no inter- 
action. 

In the 2 X 2 table, the hypothesis tested by chi-square may be stated 
in 3 ways: (i) the two attributes are independent, (ii) there is no inter- 
action in the population; that is, ad = be. (iii) pairs of ratios such as 
a / ( a + b) and c/ (c + d) are random samples from a common population 
ratio — the hypothesis of homogeneity. In the 2X2X2 table, the chi- 
square test of independence also tests the hypothesis of no second order 
interaction, adb'c ' = a'd'bc. 

In the R X C table, the hypothesis tested by chi-square is that of inde- 
pendence (or homogeneity) or that of absence of interaction. 

A special method is given for computing chi-square in an R X 2 table 
of probabilities or percentages. At the end, there are available chi- 
squares for testing interaction under either of 2 hypotheses, (i) that the 
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population has a theoretical ratio or (ii) that the attributes are inde- 
pendent. 

Computing methods are described for samples from Poisson distri- 
butions. 
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Chapter 10 


TWO OR MORE GROUPS OF MEASUREMENT DATA. 
ANALYSIS OF VARIANCE 


10.1 — Extension from two groups to many. Statistical and experi- 
mental methods for two groups were presented in chapter 4, but the 
needs of the investigator are seldom confined to only two treatments. 
More than two require extensions of the methods. For enumeration data 
such extensions were made in the foregoing chapter. We are now ready 
to do the same for measurement data, expand chapter 4 to allow for the 
simultaneous trial of three or more treatments. 

During the early 1920’s great progress was made in handling the 
statistics of multiple groups (2). Near the beginning of the decade 
Fisher (3) was able to solve some problems of distribution that completed 
his analysis of variance . This device has led to tremendous expansion in the 
design of experiments, each with an appropriate statistical analysis. Some 
of the simplest of these form the topic of the present chapter. 

10.2 — Multiple samples from a common population. Analysis of 
variance. In the earlier sections of chapter 4 we considered two groups 
of equal size: three or more such groups now concern us. As examples, 
there may be several lots of animals, every lot receiving a different ration; 
or several classes of children in the 6th grade on which different methods 
of instruction are being tried. The statistical analysis of the ensuing ex- 
perimental data is most easily explained by reference to the sampling of 
chapter 3. There we drew at random a number of groups from a normally 
distributed set of pig gains with mean, 30 pounds, and variance, 100. 
If we put together several of these groups, say 4, we shall have in effect 
an experiment with 40 pigs divided into 4 lots of 10 animals each. Ordi- 
narily the lots would receive different treatments, but our data simulate 
a dummy experiment , or uniformity trial , in which all lots are treated alike. 

The results of an experiment of the kind just described are recorded in 
table 10.1. For ease in calculation the lots have been reduced to 5 pigs 
each and the gains in weight changed slightly so as to avoid fractions. It 
is convenient to refer to the entire experiment as a sample: this is divided 
into n subsamples (lots or groups) each with k items. Thus, the experiment 
is comprised of nk individuals. 

The entire sample of 20 weight gains has the mean 29 pounds, an 
estimate of m = 30 pounds. The variance, 1,918/19 = 100.9, estimates 
cr 2 = 100. These statistics could be used, if desired, to set fiducial limits 
or to test hypotheses in the manner of chapter 2. 
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TABLE 10.1 

Gains in Weight (Pounds) of 4 Lots of Swine, 5 Per Lot. Sample From Table 3.1 


k *= 5 Pigs 
per Lot 


n = 

4 Lots 


Entire 

Sample 

1 

2 

3 

4 

1 

40 

29 

11 

17 


2 

24 

27 

31 

21 

, 

3 

46 

20 

17 

28 


4 

20 

39 

37 

33 


5 

35 

45 

39 

21 


SX 

165 

160 

135 

120 

580 

2 

33 

32 

27 

24 

29 

SX* 

5,917 

5,516 

4,261 

3,044 

18,738 

(SX)*/k 

5,445 

5,120 

3,645 

2,880 

16,820 

*Sx* 

All 

396 

616 

164 

1,918 


* This method of calculation is explained in section 5.2. 


A second estimate of the population variance is got from the sums of 
squares within the 4 subsamples. Following the method of chapter 4, 
these sums of squares are added or pooled : 

472 + 396 + 616 + 164 = 1,648 

What about degrees of freedom? Since the mean of each lot is used to 
measure deviations from, one degree of freedom is deducted from the 
original 5. Altogether, then, there are 4 d.f. in each of the 4 subsamples, 
these 16 corresponding to the sum of squares, 1,648. The resulting mean 
square, 1,648/16 = 103, is the second estimate of a 2 furnished by the 
sample. 

The lot means lead to a third estimate of the population variance. 
Because these 4 means are random samples from the same normal popula- 
tion, their variance is an unbiased estimate of <r 2 /k = 100/5 = 20. The 
variance of these means is 

(33 - 29) 2 + (32 - 29) 2 + (27 - 29) 2 + (24 - 29) 2 _ 

_ _ 18 

instead of 20. Since 18 is an estimate of <r 2 /5, then 5 X 18 = 90 is the 
third estimate of <r 2 , this one based on the subsample means with 3 degrees 
of freedom. 

The foregoing analysis is summarized in table 10.2. One additional 
feature: the variance of lot means has been multiplied by its degrees of 
freedom to form a sum of squares, 270, the utility of which will be clear 
from what follows. 

The table brings into prominence two notable facts: not only the total 
sum of squares but also the corresponding degrees of freedom have been 
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TABLE 10.2 

Analysis of Variance of Pig Gains in Table 10.1 



Degrees of 

Sum of 

Estimates 

Source of Variation 

Freedom 

Squares 

of a 2 

Individuals of the several lots 

16 

1,648 

103 

Lot Means 

3 

270 

90 

Total 

19 

1,918 

100.9 


separated into two parts, both associated with the structure of the experi- 
ment. Since the parts as well as the total were computed independently, 
their additive relationship is emphasized. You will recall similar addition 
theorems in section 6.12. This partition of degrees of freedom and cor- 
responding sums of squares, each part providing an estimate of variance, 
is called analysis of variance. 


EXAMPLE 10.1 — Analyze the variance in the following data selected for case in 
computation. 


Observation 

Number 


Subsample Number 
2 3 4 


1 

2 

3 


11 

13 

21 

10 

4 

9 

18 

4 

6 

14 

15 

19 


Analysis of Variance 


Source of Variation 


Degrees of 
Freedom 


Sum of 
Squares 


Mean 

Square 


Individuals 

Subsamples 


8 172 21 5 

3 186 62 0 


Total 


11 358 32.5 


EXAMPLE 10.2 — Calculate the sum of squares for subsamples, 186, by adding 
the four subsample correction terms, 147, 432, etc., then deducting the sample correc- 
tion term, 1,728. 

EXAMPLE 10.3 — Calculate the sum of squares for subsamples as follows: 

(21)* + (36)* + (54)* 4- (33)* (144)* 

3 12 

The numbers squared in the numerator are the subsample and sample totals, while 
those in the denominators are k and nfc. 

Which method of calculation seems easiest? 
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10.3 — Calculation of analysis of variance. Samples from different 
populations. One of the reasons for the popularity of analysis of variance 
is the elegance of its computations. Those in table 10.1 may be shortened 
considerably as soon as their meaning is learned. As a preliminary, note 
that, in the last line but one, the sum of the four correction terms, corrected 
by the one in the last column, is equal to the sum of squares for subsample 
means: 

5,445 + 5,120 + 3,645 + 2,880 - 16,820 = 270 
This obviates the separate calculation of the variance of the means and 
its subsequent multiplication by k and n — \. 

With adequate facilities for checking calculations, most people prefer 
to omit the last three lines of the central section of table 10.1, calculating 
the two sums of squares, “total” and “lot means,” then subtracting for 
“individuals.” This is feasible because the sums, SX, instead of the cor- 
rection terms, (SX) 2 /k, can be used in calculating the sum of squares for 
lot means, thus: 

. (1 65) 2 + (160)2 + 035)1 + (120)2 (580) 2 

— -- 2/0 

5 20 

The only change is that division by k takes place after the squares are 
added instead of before. The form is easily remembered because the 
denominator in each pai t is the number of individuals, k or nk, the square 
of whose sum is in the numerator. 

To illustrate this abbreviated calculation, and also to introduce an 
experiment in which the subsamples may not be drawn from a common 
population, we use the data in table 10.3. During cooking, doughnuts 
absorb fat in various amounts. Lowe (9) wished to learn if the amount 
absorbed is characteristic of the fat used. Six mixes, each containing 
24 doughnuts, were cooked in each of 8 fats. The numbers in the table 
are the grams of fat absorbed per mix. The mix means, unnecessary in 
the computations, should always be recorded in the table. 

The two necessary parts of the computation are (i) the total sum of 
squares, 9,193, in the righthand column, calculated by summing the 
X 2 's over the whole table, ignoring groups, then correcting for mean in 
the usual manner; and (ii) the sum of squares of subclass (fat) means, 
3,527, got from the subclass sums as described above., The remainder of 
the calculations follow immediately from these two sums of squares. 

The beginner would do well to sum the squares, &Y 2 , in each subclass 
separately, then add these 8 sums to verify that at the right, 262,364. 

A symbolical summary of the calculations, table 10.4, may be helpful 
to some readers. 

If no calculating machine is available, proceed as follows: (i) Code 
each subclass separately by subtracting a convenient number about the 
size of its mean, then calculate its Sx 2 . The sum of n such is the sum of 
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TABLE 10.3 

Grams of Fat Absorbed by Mixes of 24 Doughnuts 



Fat Number 


Row No. 

1 

2 

3 

4 

5 

6 

7 

8 

i 

Total 

1 

164 

173 

177 

178 

163 

175 

178 

155 


2 

172 

161 

183 

191 

165 

193 

146 

166 


3 

168 

190 

197 

197 

144 

178 

141 

149 


4 

177 

180 

169 

182 

177 

171 

150 

164 


5 

156 

197 

179 

185 

165 

163 

169 

170 


6 

195 

167 

187 

177 

176 

176 

182 

168 


*SX 

432 

468 

492 

510 

390 

456 

366 

372 

3,486 

*sx* 









262,364 

*C = (, sxy/nk 









253,171 

Sx* 









9,193 

£ 

172 

178 

182 

185 

165 

176 

161 

162 

172 6 


Fats: 


(432) a + (468)2 + . . . -f (372) 2 _ 253 m = 1 ,540,18 8 _ 


Analysis of Variance 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Total 

47 

9,193 


Fats 

7 

3,527 

503 9 

Mixes within fats 

40 

5,666 

141 6 


* In the calculation of these 3 lines, the digit 1 in the first place is ignored; that is, 
the data are coded by subtracting 100. 


squares for mixes, 5,666. (ii) Code the 8 subclass means by subtracting 
a suitable number, say 170, calculate their sum of squares, then multiply 
by k. This yields the sum of squares for subclasses, 3,527. If desired, the 
total sum of squares may be got by adding the other two, but there is little 
occasion for computing it. 


10.4 — Test of a null hypothesis. The variance ratio, F. The dough- 
nut data of table 10 3 introduce a familiar question: is ordinary random 
sampling accountable for the large discrepancy between the mean squares 
for fats and mixes, or shall we conclude that the fat means are differen- 
tiated by causes other than sampling fluctuations? Lowe suspected that 
the nature of the several fats might result in different amounts of absorp- 
tion. If so, the mixes are samples from different populations instead of a 
common one. Even if these populations are all assumed to be normally 
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TABLE 10.4 

Symbolical Summary of Computations in Analysis of Variance 
Single criterion of classification; n subsamples, each with k items. 
Total number of items in sample =* nk 


The data 

Group 1 

Group 2 . . . 

Group p . , 

, . Group n 


** 


x n 

k items 

k items 

k items 

k items 

Sums: 5*1 

s t 

S, 

■s; 


The computations 


1. The sum of all items: S & -f & ■+■ . . . -f $n 

2. The correction term: C = S*/nk 

3. Total sum of squares: SX 2 — C 

4. Groups: (Si* + &* + ...+ S n *)/k - C 

5. Individuals: total — groups 

Degrees of freedom 

Total nk — 1 

Groups n — 1 

Individuals nk — n — n(k — 1) 


distributed, they may have different means or different variances or both. 

One null hypothesis that can be tested is that all the mixes are random 
samples from the same normal population, as were the lots of table 10.1. 
For evidence about ordinary sampling variation under this hypothesis, 
we go back to the drawings from table 3.1. Suppose the original groups 
of 10 gains are now combined into samples of 10 groups as in table 10.5. 
The analysis of variance of the entire 1 00 items shows that the ratio of the 
two independent estimates of variance, group means and individuals, is 
172.3/ 1 14.0 = 1.51 and we know that sampling variation accounts for 
the failure of these two estimates to be equal. It is necessary next to learn 
the magnitude of variation ordinarily encountered in such ratios. 

The ratio of the two estimates of variance, 

Mean square of group means 
Mean square of individuals 

has a distribution discovered by Fisher. I named it F in his honor (14). 
Fisher tabulated the distribution in the form, z = log e \/F (4). Fisher 
and Yates (5) designate F as the variance ratio , while Mahalanobis (10), 
who first calculated it, called it x . 

The distribution of F in 100 samples like that of table 10.5 is shown in 
table 10.6. One notices first the skewness: a concentration of small 
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TABLE 10.5 

Ten Groups, Each With 10 Items, Randomly Drawn From the Normal Distribu- 
tion of Table 3.1. This Simulafes a Dummy Experiment With 10 Lots of Pigs 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

35 

34 

17 

39 

18 

7 

33 

42 

7 

39 

42 

38 

29 

34 

7 

22 

29 

35 

48 

34 

42 

26 

30 

22 

31 

40 

25 

31 

43 

33 

30 

17 

36 

27 

41 

29 

27 

33 

53 

33 

15 

42 

41 

42 

21 

31 

21 

53 

7 

33 

31 

28 

30 

33 

17 

41 

46 

21 

33 

39 

29 

35 

3 

24 

21 

30 

19 

41 

17 

36 

29 

33 

23 

36 

40 

12 

43 

29 

57 

32 

17 

16 

38 

29 

14 

44 

22 

34 

42 

32 

21 

40 

30 

25 

14 

30 

21 

49 

42 

30 


Analysis of Variance 


Source of Variation 


Degrees of Freedom Mean Square 


Group means 
Individuals 


9 172 3 

90 114 0 


F = 172 3/114 0 - 1 51 


values and a long tail of larger. Next, there is the fact that 65 of the F s 
are less than 1. If you remember that both terms of the ratio arc esti- 
mates of <r, you may be surprised that 1 is not the median. However, 
the mean, calculated as in section 8.3, is 0.96, about the average one would 
expect. Finally it is to be observed that 5% of the values lie beyond 2.25 
and 1% beyond 2.75, so that these points are roughly comparable with 
those of the theoretical distribution next to be considered. 

Table 10.7 contains the theoretical 5% and 1% points of F for con- 
venient combinations of degrees of freedom. Across the top of the table 
is found ti\ dj. corresponding to the number of subsamples: ri\ = n — 1. 
At the left is n 2 , the degrees of freedom for individuals, n(k — 1). To 
find the 5% and 1% points for table 10.5, look in the column headed by 

TABLE 10 6 

Distribution of Fin 100 Samples From T\bie 3.1. Degrees of Freedom 9 and 90 


Class Interval Frequency 


Class Interval Frequency 


0 

-0 

24 

7 

0 

25-0 

49 

16 

0 

50-0 

74 

16 

0 

75-0 

99 

26 

1 

00-1 

24 

11 

1 

25-1 

49 

8 


1.50-1 74 5 

1 75-1 99 2 

2 00-2 24 4 

2.25-2 49 2 

2 50~2 74 2 

2 75-2 99 1 
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7i\ = 9 and down to the row n% = 90. The points are 1.98 and 2.62, 
halfway between the values for 80 d.f. and 100. In order to check the 
sampling distribution more exactly, I went back to the original calcula- 
tions and found 8% of our F s beyond the 5% point and 2% beyond the 
1%. This gives some idea of the variation to be encountered. 

For the doughnut experiment, the hypothesis set up — that the mix 
data are random samples from a common population — may be judged by 
means of table 10.7. From the analysis of variance in table 10.3, 

F = 503.9/141.6 = 3.56 

For ni = 7 and n 2 = 40 the 1% point in the table is 3.12. Thus there is 
less than one chance in 100 of drawing a sample having a larger value 
of F from the distribution specified in the hypothesis. Evidently the sub- 
samples come from different populations. 

The conclusion just reached may mean that each of the 8 fats con- 
stitutes a separate population of fat absorption, or it may mean that 
some of the fats belong to one population and some to another. There 
might be any number of populations from 2 to 8. The analysis of variance 
does not provide information on this point: there is no way of distin- 
guishing the subsamples of one population from those of another. How- 
ever, the fat means, each an unbiased estimate of its population mean, 
contain available evidence. Fat 4 ranked highest in absorption by the 
doughnuts, followed closely by 3. Fats 7, 8, and 5 were lowest. Further 
evidence is found in the fiducial interval. If 141.6 be accepted as an esti- 
mate of a common variance in all the populations, with 40 degrees of 
freedom, the 95% semi-interval is t 0 hSz — (2.021) x/141.6/6 = 10 pounds 
(9.82, more exactly). Thus, m for fat 4 may be around 185 — 10 = 175 
pounds, and that for fat 7 may be near 161 + 10 = 171 pounds. You 
see, it is unlikely that these two fats have the same population mean. As 
for fats 3 and 8, their m * s might both be 172, but such a coincidence is 
improbable. Now, this is about all that can be learned from the fiducial 
interval, because fats 2 and 5 may well have the same m. It seems fairly 
clear that fats 3 and 4 may belong to one population while 7 and 8 belong 
to another. There may be a population with an intermediate mean, 
and this may include fats 1, 2, and 6. What the experiment tells us 
certainly is (i) that the 8 subsamples are probably npt drawn from a 
common population, and (ii) that whatever population means are in- 
volved are estimated in an unbiased manner by the subsample means. 

As you progress in your study of experimental design you will find that 
experiments can be set up to test simultaneously a whole array of null 
hypotheses. The simple structure of the doughnut experiment cannot be 
expected to support more than the single test described. 

Three remarks close this section, (i) The F distribution is based on ran- 
dom sampling from normally distributed populations, but it has been 
found (12) that little bias is introduced into the test of significance by 
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TABLE 10.7 — 5^ c (Roman Type) and 1% (Bold Face Type) Points for the Distribution or F 
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The function, F ~e with exponent 2 a, is computed in part from Fisher’s table VI (7). Additional entries are by interpolation, mostly graphical. 
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moderately skewed distributions, (ii) Precautions about conduct of the 
experiment are the same as those in section 4.3. Some pertinent evidence 
will be presented in section 10.9. (iii) It is usual to denote a probability 
between the 5% and 1% levels by an asterisk; beyond the 1% level, by a 
double asterisk. See tables 10.13 and 10.17. 

EXAMPLE 10.4 — Here are four groups of data for easy practice: 

Group 1: 6, 5, 5, 2, 1, 5, 4, 2, 4, 6 
Group 2: 7, 9, 6, 8, 4, 7, 11, 8, 7, 3 
Group 3: 6, 12, 9, 13, 10, 11, 10, 8, 11, 10 
Group 4: 7, 4, 5, 3, 2, 0, 9, 7, 6, 7 

Test the significance of the differences among the group means. Ans. F = 14. 

EXAMPLE 10.5 — The percentage of clean wool in 7 bags was estimated by taking 
3 batches at random from each bag. The percentages of clean wool in the batches were 
as follows: 


Batch 

Bag Number 

1 

2 

3 

4 

5 

6 

7 

1 j 

41.8 

33 0 

38 5 

43 7 

34 2 

32 6 

36 2 

2 

38 9 

37 5 

35 9 

38 9 

38 6 

38.4 

33 4 

3 

36 1 

33.1 

33 9 

36 3 

40 2 

34 8 

37.9 


Show that F = 1.35. 


EXAMPLE 10.6 — In the foregoing wool data, are you willing to accept the hypothe- 
sis that the 21 batches are randomly drawn from a single, normally distributed popu- 
lation? If so, the mean of the entire sample is 36.85% with variance (20 d /.), 9.086, 
figures easily verified from your calculations. Assuming all bags of the same weight, 
set 95% fiducial limits to the mean percentage of clean wool in the 7 bags. Ans. 
35.48%-38.22%. (See example 10 12.) 

EXAMPLE 10.7 — In table 10 4, p stands for any group, being successively 1, 
2, . . . n A good exercise in algebra is proving the formula, 

s{x - xy - ks(x v - ty 4 - ss{x p - x v )\ 

which is the addition theorem for sums of squares in analysis of variance. The left 
member of the equation is the sum of squares of deviations of all items from the 
general mean, x. The first term in the right member is the sum of squares among 
class means, x v representing successively the n such means. The last term is the sum 
of the n sums of squares of deviations of class items from the class mean. Whether you 
derive the formula or not, it should be carefully studied. 


10.5 — Analysis of variance in 2 subsamples. Chapter 4 is a special 
case of the present one, so it is informative to analyze the variance of 
the two lots of chicks studied in that chapter. As has been noted, the 
pooling of the subsample sums of squares is a common feature of the two 
chapters. For the chick data of tables 4.1 and 4.3, this pooled sum of 
squares, 16,220, with d.f. = 20 is entered in table 10.8. We shall complete 
the analysis of variance by computing the sum of squares for the two 
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TABLE 10.8 

Analysis op Variance of Chick Comb Experiment, Table 4.1. Two Subsamples 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Chicks 

20 

16,220 

811 

Lot means 

1 

9,245.5 

9,245.5 


F * 9,245.5/811 - 11.4. y/F - 3.38 - t 


subsample means, 97 and 56 mg. It is perhaps clearest to calculate 
first the two sums, 97 X 11 == 1,067 and 56 X 11 = 616. Then, the 
sum of squares is, as usual, 

(1,067) 2 + (616) 2 _ (1,683) 2 ^ 

11 22 


This is entered in the table and the analysis then completed. F is greater 
than its 1% point, 8.10, just as t in table 4.3 was beyond its. 

Here are two points of interest, (i) If there is only a single degree of 
freedom for subsample means, then y/F = t. The probability of F 
greater than 11.4 is identical with that of t greater than 3.38, so it is a 
matter of choice which one uses, (ii) With only two subsamples, the sum 
of squares for their means is easily calculated this way: 


(1,067 - 61 6) 2 

(H)(2) 


9,245.5, 


as before. Merely square the difference of the two subsample sums, then 
divide by the total sample size. Equally, the result may be got thus: 


11(97 - 56) 2 
2 


9,245.5 


I think you will agree that the F test is easier to perform than the t. 


The reader interested in a short course may now go to section 10.8 and thence to chapter 77, 


EXAMPLE 10.8 — Hansberry and Richardson (6) gave the percentages of wormy 
apples on two groups of 12 trees each. The trees of group A, sprayed with lead arsenate, 
had 19, 26, 22, 13, 26, 25, 38, 40, 36, 12, 16, and 8 percent of apples wormy. Those of 
group B, sprayed with calcium arsenate and buffer materials, had 36, 42, 20, 43, 47, 
49, 59, 37, 28, 49, 31, and 39 percent wormy. Compute the mean square within groups , 
111.41, with 22 dj.\ and that between group means , 1650.04, with 1 d.f. Then 

F « 1,650.04/111.41 - 14.8 

Now, test the significance of the difference between the group means in the manner 
of table 4.3. The value of t is 3.85 — \/14.8. 
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10.6 — Estimates of variance. The striking difference between the pig 
experiment of table 10.2 and the doughnuts of 10.3 is this: in the first, 
the mean squares for lot means and for individuals both estimate the same 
variance, <r 2 ; but in the second, the fat means exhibit a significantly larger 
variance than that of the mixes. Something has been added. This suggests 
a different attitude toward the two experiments. In the first, the means, 
all drawn from the same population, had only the variability due to 
random sampling from the common source; whereas, the fat means had 
an additional component attributed to differences among population 
means. 

It is easy to estimate this added component, c r m 2 , if it is assumed to be 
uncorrclated with <r 2 , because in that case the variance of group means 
is simply the sum of two variances. One is er m 2 , that of the fat population 
means; while the other is a 2 / k, the variance of the mean of the subsamplc 
containing k mixes. Thus, 


Variance of subsample means = <r m 2 



Now <t 2 , assumed to be the same in all the populations, is estimated by 
the mean square for individuals, r = 141.6 in the doughnut experiment 
(As usual, s represents the sample estimate of the population a.) Further- 
more, if you remember (section 10 2) that the variance of the subsample 
means is (mean square)//:, then we may write for the doughnuts, 

503.9 o 141.6 

= S m 2 ■ j — 


Solving this equation, 


503.9 - 141.6 



60.4 


Thus, for estimating a m 2 , we have the easily remembered rule: Subtract 
the mean square for individuals from the mean square for subsamples , then divide 
by the number of individuals per subsample. 

Don’t get the idea that you can isolate s m 2 experimentally. The only 
way to evaluate a mean is through the sum of the individual measure- 
ments, and individual variation sticks to these measurements inseparably. 

It is not hard to illustrate these concepts by sampling. One popula- 
tion can be set up to represent the subsamplc means with variance tr*, 2 
and another to represent individuals with variance c r 2 ; then samples can 
be drawn from each and combined in any desired proportion. In table 
10.9 is one such drawing. The gains representing the lot components are 
from table 3.11 in which or m 2 = 25: there is one of these for each lot. 
Individual pig gains are taken from table 3.1 where a 2 = 100. The surn 
represents the experimental gain in weight set down in column 4 and 
analyzed in the lower part of the table. 
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TABLE 10.9 

Gains in Weight of 20 Pigs in 10 Lots of 2 Pigs Each 
Each gain is the sum of two components. The component for lots is a sample from 
table 3.11 with <r 2 ■= 25, while that for individuals as from table 3.1 with <A — 100 



Lot (subsample) 

Individual 



Lot Number 

Component 

Component 

Pig Gains 

Lot Gains 

a) 

(2) 

(3) 

(4)-(2) + (3) 

(5) 

1 

29 

37 

66 




39 

68 

134 

2 

32 

26 

58 




7 

39 

97 

3 

29 

30 

59 




49 

78 

137 

4 

30 

32 

62 




32 

62 

124 

5 

26 

33 

59 




42 

68 

127 

6 

20 

39 

59 




33 

53 

112 

7 

40 

35 

75 




26 

66 

141 

8 

32 

11 

43 




20 

52 

95 

9 

34 

26 

60 




48 

82 

142 

10 

28 

45 

73 




24 

52 

125 

Analysis of Variance 


Source of Variation 

Degrees of Freedom 

Mean Square 

Lot means 

9 

144 6 

Individuals 

10 

96 5 


j 2 = 96.5 j m 2 = (144 6 - 96.5J/2 - 24.0 


In the last line of the table is the verification sought. The individual 
mean square is an estimate of a 2 = 100. The variance of lot means (the 
lot mean square divided by £), that is, 144.6/2 == 72.3, is an estimate of 

dm + <r 2 /k = 25 + 100/2 = 75 
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Finally, <r m 2 = 25 is estimated by 

144.6 - 96.5 

2 


24.0 


In this way, from the analysis of variance we have recovered estimates of 
the two components that we put into the pig gains. There is here some 
sampling variation, of course, but not as much as usually may be expected: 
this example was chosen because of its accurate estimates. An idea of 
the ordinary variation can be got from examination of the records of 25 
similar samples in table 10.10. 


TABLE 10.10 

Estimates of cr m * « 25 and a 3 =* 100 Made From 25 Samples Drawn Like 
That of Table 10.9 


Sample Estimate of Estimate of 
Number <r m s ® 25 <r 2 = 100 


1 

60 

127 

2 

56 

104 

3 

28 

97 

4 

6 

91 

5 

18 

60 

6 

- 5 

91 

7 

7 

53 

8 

- 1 

87 

9 

0 

66 

10 

-78 

210 

11 

14 

148 

12 

7 

162 

13 

68 

76 


Sample 

Number 

Estimate of 
* m 2 - 25 

Estimate of 
<r 2 - 100 

14 

56 

112 

15 

-33 

159 

16 

67 

54 

17 

-18 

90 

18 

33 

65 

19 

-21 

127 

20 

-48 

126 

21 

4 

43 

22 

3 

145 

23 

49 

142 

24 

75 

23 

25 

77 

106 

Mean 

17 0 

102 6 


One is struck immediately by the great variability in the estimates 
of <7 m 2 , some of them even being negative. These latter merely indicate 
that the mean square for lots is less than that for individuals: the lots 
vary less than random samples ordinarily do if drawn from a single, 
normal population. Clearly, one cannot hope for accurate estimates of 
<r 2 and <r m 2 from such small subsamples. 

EXAMPLE 10.9 — Among the 25 samples reported in table 10.10, one of the more 
divergent had this analysis of variance: 

Lot means 9 67.7 

Individuals 10 53.2 

Show that <r m * is estimated by 7.2. 

EXAMPLE 10.10 — Another sample resulted in this analysis: 

Lot means 9 53 5 

Individuals 10 210 4 

If you are trained mathematically, you will probably be surprised by the estimate, 
s m * = —78.4. Sampling variation has caused an exaggerated estimate of <r* « 100 
and an underestimate of <r 3 4- 2<r m 2 * 150. If you set up the hypothesis that <r m * = 0, 
you would have little evidence against it in this sample. For another viewpoint, see 
section 10.11. 
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EXAMPLE 10.11 — Show that in the sampling of table 10,10, the value of F based 
on population parameters is 1.50. In the 25 samples, mean F == 1.67. 

EXAMPLE 10.12 — In the wool data of example 10.5, s 2 — 8.22 and s 3 -f 3 s m * =» 
11.11, Calculate s * 2 ■* 0.96. If you think that the hypothesis, cr m a 0 should be 
rejected despite the small value of F, and if you suspect that the bags were not all 
equal in weight, you will find that appropriate methods of estimation are discussed 
in chapter 17. 

10.7 — Estimates of variance and the F test. The foregoing section 
adds to one’s understanding of the F test of significance. From what has 
been said it is plain that the mean square for subsamples is an estimate of 


It may be concluded, then, that in the populations sampled the variance 
ratio is 


+ cr, 


•- 


cr 2 + k<r n 


subsample mean square a 2 + ka m 2 

individual mean square cr 

Hence, the hypothesis which was set up in section 10.4 implies that 
cr m 2 — 0 and that F, therefore, is an estimate of 1. In verification, the 
mean of the F's in table 10.6 was 0.96, and these F’s were samples from a 
population having the specifications of the present hypothesis. 

Contrast with this the doughnut experiment. Using the sample 
estimates calculated in the foregoing section, 

n HI 6 + 6(60.4) „ ^ 

F = = 3.56 

141.6 

The additional thing now clear is the manner in which the large F is 
caused by the component of variance, a m 2 = 60.4, associated with the fat 
means. In the ordinary run of samples, rejection of the null hypothesis 
results from the added term, £<r m 2 , in the numerator of the variance ratio. 

In table 10.11 is a summary of the estimates and of the test of signifi- 
cance provided by the analysis of variance. 


TABLE 10.11 

Estimates and Test of Significance in Analysis or Variance. Subclass 
Numbers Equal 


Source of 

V ariation 

Degrees of 
Freedom 

If <r m 2 « 0, 
Mean Square 
Is Estimate of 

If 0 

Mean Square 
Is Estimate of 

If ^ 0 
Variance 

Is Estimate of 

Subsample means 

n - 1 

cr 2 

a 2 + k<r m 2 

«•*/* + j.’ 

Individuals 

n(k - 1) 

cr 2 

<T 2 

o- 2 

Total 

nk - 1 

a 2 



F is estimate of 


1 

<T 2 + k<*rn 

<J 2 
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10.8— Subsamples with different numbers of individuals. Some- 
times it is not possible to provide subsamples of equal size, and at other 
times, not desirable. As an example of the latter case, suppose one is 
investigating the butterfat content of the milk from cows of various breeds 
supplying the creameries of some district. He might wish to conduct the 
investigation so that the average butterfat yield per cow when multiplied 
by the total number of producing animals in the district would be an 
estimate of the total fat production. In that event, it would be necessary 
to sample the breeds by taking from each a number of cows proportional 
to the district population of that breed. Otherwise, the representatives of 
some rare, low yielding breed would inordinately depress the average. It 
is just as important to choose samples representative of the population 
examined as it is to make quite random the taking of the ultimate experi- 
mental units. The two procedures are not incompatible, both being in- 
tegral parts of good sampling designs. Incidentally, the criterion of repre- 
sentativeness makes it imperative that the investigator confine his interpre- 
tations to the sampled population, not broadcasting them over populations 
in general. 

Another source of unequal-sized subsamples is that illustrated in table 
10.12. Much experimental material has such natural groupings. Eight 
sows bore these litters, so that differences among the mean birth weights 
may be attributed partly to sow individualities and partly to litter size 

So far as method of computation is concerned, only a single minoi 
alteration is required to adapt it to the inequality in subsample sizes. 
This is in connection with the sum of squares between means of litters. 


TABLE 10.12 

Birth Weights (Pounds) of Poland China Pigs in Eight Litters 
Spring, 1919 










Litter 










Symbol 

: 

l 


> 


> 

i 

1 

c 

) 

( 

3 


1 

8 

Total 

X 

2 

0 

3 

5 

3 

3 

3 

2 

2 

6 

3 

1 

2 

6 

2 

5 




2 

8 

2 

8 

3 

6 

3 

3 

2 

6 

2 

9 

2 

2 

2 

4 




3 

3 

3 

2 

2 

6 

3 

2 

2 

9 

3 

1 

2 

2 

3 

0 




3 

2 

3 

5 

3 

1 

2 

9 

2 

0 

2 

5 

2 

5 

1 

5 




4 

4 

2 

3 

3 

2 

3 

3 

2 

0 



1 

2 






3 

6 

2 

4 

3 

3 

2 

5 

2 

1 



1 

2 






1 

9 

2 

0 

2 

9 

2 

6 












3 

3 

1 

6 

3 

4 

2 

8 












2 

8 



3 

2 














1 

1 



3 

2 













sx 

28 

4 

21 

3 

31 

8 

23 

8 

14 

2 

11 

6 

11 

9 

9 

4 

152 

4 

k 

10 


8 


10 


8 


6 


4 


6 


4 


56 


£ 

2 

84 

2 

66 

3 

18 

2 

98 

2 

37 

2 

90 

1 

98 

2 

35 



SX* 

88 

84 

60 

19 

101 

80 

71 

52 

34 

34 

33. 

.88 

25 

57 

23 

26 

439. 

.40 
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Since k changes from one litter to another, the eight quantities, (£A r ) 2 , 
must each be divided by its own k in order to reduce it to a per-item basis. 
That is, (28.4)710 + (21.3)78 + . . . + (9.4)74 - (152.4)756 = 7.48 is 
the computational procedure which replaces the fourth step of section 
10.3. The corresponding formula, replacing step 4 in table 10.4, is 

SiVh + Sf/k* + . . . + S n */kn - C 

Thejremaining results of calculation from table^lO.lO are easily verified: 

Correction7crm = (152.4)756 = 414.75 

Totaf[s\m\ m of squares = 439.40 — 414.75 = 24.65 

The total sum of squares and that between litters, entered in table 10.13, 
are subtracted to get the sum of squares for pigs of the same litter. 


TABLE 10.13 

Analysis of Variance of Birth Weights of 56 Pigs in 8 Litters of Various Sizes 



Degrees of 

Sum of 

Mean 

Source of Variation 

Fi ecdom 

Squares 

Square 

Total 

55 

24 65 


Litter means 

7 

7 48 

1 07** 

Pigs of same litter 

48 

17 17 

0 36 


The degrees of freedom may be got by subtraction, also; or they may be 
thought of as the sum of those from the eight litters, 

9 + 7 + 9 + 7 + S + 3 + 5 + 3 = 48 

This way of thinking about degrees of freedom is valuable because it 
brings to mind the fact that the associated sum of squares, 17.17, is itself 
the sum of eight litter sums of squares, each of which may be independently 
computed. 

The two mean squares, for litters, 1.07, and for pigs, 0.36, are notably 
different. Since F = 1.07/0.36 = 2.97 is just short of the 1% point, 3.04, 
it is not likely that the litter birth weights constitute random samples from 
a homogeneous population. As has been indicated, there are two sources 
of variability in the litter means. First, the dams may differ in their ability 
to nourish the fetuses, resulting in a variation of mean birth weights even 
when other factors are identical. Second, the size of the litter may affect 
the pig weights. Since the segregation of the sums of squares attributable 
to these two sources of variation involves an extension of the foregoing 
statistical methods, it will be postponed till the following sections. At 
present it is only necessary to note the fact that there are real litter differ- 
ences to be examined and, if possible, explained. Sec, also, section 10.13. 
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The symbolical summary of computations for varying numbers of 
items in the groups is set out in table 10.14. One new feature appears. 


TABLE 10.14 

Symbolical Summary of Computations, With Varying Numbers 
of Items in the Subsamples 


The Data 


Group 1 

Group 2 . . 

, , Group p . . 

. Group n 

Xx 



x n 

ki items 

k 2 items 

k p items 

k n items 


Si S t ... S v ... Sn 


The Computations 


1. The sum: S ^ Si 4* S 2 4“ * • • *■}"*$» 

2. The correction term- C = S 2 /Sk 

3. The total sum of squares: SX* — C 

4. Subsample means: S?/k i + S n */k n — C 

5. Individuals: total — subsample means 

6. Average k: k a - — ■!— (Sk - 21) 

n — 1 dk 


Analysis and Estimates of Variance 


Source of 
Variation 

Degrees of 
Freedom 

If <r m 2 - 0 

Mean Square 

Is Estimate of 

If ( r m 2 ^ 0 
Mean Square 

Is Estimate of 

If On? 5 1* 0 
Variance 

Is Estimate of 

Subsample means 
Individuals 

n — 1 

Sk ~ n 

cr 2 

a 2 

o' 2 4- k 0 <r m 2 

cr 2 

O i fk 0 4- Or 2 
o 2 

Total 

Sk - 1 

cr 2 



F is an estimate of 


1 

o' 2 T k 0 a m 2 

a 2 



Since the subsamples have varying k' s, an average number, k oy is needed 
for estimating variance. This is, 



For the pig data, Sk is the total sample size, 56 animals. Sk 2 = 10 2 + 10 2 
+ 8 2 + . . . + 4 2 = 432. Then 



This average is always smaller than the mean of the £’s. 
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<r ro 2 is now estimated by 


1.07 - 0.36 
6.90 


0.103 


EXAMPLE 10.13 — Here is an easy example for practice: 

Group 1: 18, 11, 16, 12, 14, 17, 18, 20 
Group 2: 19 
Group 3: 16, 18, 26 

Group 4: 19, 15, 15, 10, 24, 17, 13, 17, 17, 13, 16 
The total sum of squares, 307.65 with 22 d /., is divided into 50.15 for group means, 
dj. *= 3, and 257.50 for individuals. F « 1.23. 


EXAMPLE 10.14 — Applying analysis of variance to the two lots of rat gains in 
section 4.5, show that y/F ' =■ t — 1.89. What portions of table 4.5 are used in the 
present computations? 


EXAMPLE 10.15 — Jenkins (8) and Snedccor (14) compared the yields of a number 
of varieties of corn, each variety being represented by several inbred lines. Six varieties, 
with yields (bushels per acre) of their inbred lines, are as follows: 

1. Four County: 7.3, 4.5, 7.4, 7.4, 5.0, 5.9, 6.4, 6.3, 5.0, 6.1, 7.9, 5.7 

2. Silver King: 7.7, 5.4, 5 2, 4.0 

3. Iodent: 6.9, 6.8, 7.6, 8.1, 9.4, 12.0, 15.9, 7.4, 9.0, 5.2, 9.2, 8.6 

4. Lancaster: 9.6, 7.8, 9.6, 7.7, 8.2, 7.3, 11.3, 9.5, 8.8, 8.4, 6.8 

5. Osterland: 4.8, 9.2, 8.5, 8.8, 7.9, 5.9, 9.2 

6 Clark: 4.3, 8.4, 6.6, 4.9, 5 8, 7.6, 3.7 

Calculate F = 17.55/3.38 = 5.19, dj . = 5 and 47. Estimate the component of vari- 
ance associated with the varieties of the preceding example. Ans. k 0 = 8.63, s m 2 *= 1 64. 


EXAMPLE 10.1 6 — Among 224 mice inoculated with 3 strains of typhoid organisms, 
the days of survival are summarized in the following frequency distributions. In 
applying the method of section 8 2 it is perhaps easier not to code the days-to-death, X> 
but to use the recorded data. The column of totals is necessary only for verification. 
Calculate F = 179.9/5.78 - 31.1, dj . = 2 and 221. 


Days to Death 

Numbers of Mice Inoculated 

With Indicated Strain 

9D 11C DSC1 

Total 

2 

6 

1 

3 

10 

3 

4 

3 

5 

12 

4 

9 

3 

5 

17 

5 

8 

6 

8 

22 

6 

3 

6 

19 

28 

7 

1 

14 

23 

38 

8 


11 

22 

33 

9 


4 

14 

18 

10 


6 

14 

20 

11 


2 

7 

9 

12 


3 

8 

11 

13 


1 

4 

5 

14 



1 

1 

Total 

31 

60 

133 

224 

SX 

SX* 

125 

561 

442 

3,602 

1,037 

8,961 

1,604 

13,124 
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EXAMPLE 10,17 — If k v represents the number of items in the />th class, X v the 
observed values in that class, and x v the mean, then the addition theorem for sums of 
squares is given by 

S(X - x )* - Sk p {x p - £)* + SS(X V - f p )* 

Since the values of k v are not the same for all classes, each ( x v — x )* must be multi- 
plied by its own k v . Contrast this formula with that of example 10.7. Can you prove 
the theorem? You have already verified it several times by arithmetic processes. 

10.9 — Analysis of variance of subsample means. Equal subsample 
numbers. Repeatedly, attention has been called to the fact that when 
lots of individuals are kept in separate compartments, then if different 
treatments are applied to the lots there is no way to learn whether result- 
ing differences are due to the treatments or to other causes. The experi- 
ment summarized in table 10.15 illustrates one of the many designs that 
furnish information about any extraneous environmental differences that 
may affect the subsamples. Here, each treatment is applied to the hens 


TABLE 10.15 

Eggs Laid by Each of 1 20 Hens. Six Treatments, Each Applied to 2 Lots 
of 10 Hens. Each Lot in Separate Cage 


Treatment | 

1 

2 1 

3 

4 

5 


6 

Lot 

i 

2 

1 1 

2 

1 

2 

! i 

2 

1 

2 

1 

2 

1 

100 

70 

1 94 

90 

84 

88 

82 

78 

87 

87 

70 

64 

2 

51 

90 

86 

88 

52 

86 

72 

86 

66 

79 

55 

56 

3 

39 

102 

52 

78 i 

103 

77 

72 

93 

50 

41 

71 

26 

4 

56 

81 

! 81 

102 ! 

61 

66 

77 

87 

92 

82 

74 

58 

5 

! 56 

83 

60 

45 

63 

91 

67 

63 

96 

62 

64 

61 

6 

! 77 

98 

84 

90 

84 

45 

61 

72 

67 

83 

51 

59 

7 

75 

85 

81 

58 1 

78 

83 

29 

50 

85 

100 

42 

53 

8 

73 

90 

! 97 

70 

51 

106 

36 

81 

80 

51 

86 

72 

9 

65 

90 

! 81 

65 

76 

88 

58 

78 

76 

85 

79 

48 

10 

78 

22 

78 

11 4 

32 

63 

64 

60 

86 

78 

68 

25 

Total 

670 

811 

794 

800 

684 

793 j 

618 

748 

785 

748 

660 

522 

Lot means 

67 

0 81 1 

79 

4 80 0 

68 

4 79 3 

61 

8 74 8 

78 

5 74 8 

66 

0 52 2 

Treatment 













Sums 

1,481 

1,594 

1, 

,477 

1,366 ! 

1 1,533 

1, 

,182 

Means 


74 0 


79 7 


73 8 


68 3 


76 6 


59 1 


Preliminary Analysis of Variance 


Source of Variation 

Degrees of Freedom 

Sum of Squares 

Mean Square 

Total 

119 

40,992 6 


Cages 

11 

8,827 9 


Individuals 

108 

32,164 7 

297.8 
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TABLE 10.15 -(Continued) 

Analysis of variance of Cage Means 


Cages 

Treatments 

Cages treated alike 


11 8,827 9 

5 5,3723 1,0745 


6 3,455 6 575.9 


Completed Analysis of Variance 


Treatment 

Cages treated alike 

Individuals 


5 5,372 3 1,074 5 

6 3,455 6 575 9 

108 32 , 164.7 297.8 


Total 


119 40,992 6 


in two cages. If these identically treated cages arc randomly chosen, 
any differences among the numbers of eggs laid in them, exceeding normal 
sampling vaiiation, are attributable to differences in environment, such 
as light, heat and air circulation. In the present experiment, these differ- 
ences, if any, are included in the experimental error. Later we shall 
explain more efficient designs that will tend to exclude them. 

In the preliminary analysis, the total sum of squares is separated, in 
the usual fashion, into parts due to cage and individual variation. But it 
is clear that cage differences should be distinguished. One source is 
treatments, another is those unexplained causes that may produce differ- 
ences in egg production (in excess of random fluctuation) between cages 
treated alike. The sum of squares for the former is got from the treatment 
sums by the familiar procedure, 

(1,481)=+ (1,594)= + ... + (1,182)= (8,633)= E , 

— a ,a/z.3, 

20 120 


the correction for mean being the same as that used in the preliminary 
analysis. This result leads to the analysis of variance of cage means 
in the table, and thence to the completed analysis. 

Note on computation: The method just outlined is the customary one, 
the sum of squares for the overall classification, cages, being calculated 
first; that for the major classification, treatments, second; and finally 
the remainder, corresponding to cages treated alike. But the latter sum 
of squares can be calculated independently. Since there are only two 
subsamples per treatment, the method of section 10.5 is applicable. Thus, 
for the first treatment, the single degree of freedom for cage means has 
the sum of squares (or mean square), 


(811 - 670) 2 


994.05 


2 ( 10 ) 
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The other 5 dj \ have the sums of squares: 1.80, 594.05, 845.00, 68.45, 
and 952.20, the sum being 3,455.55, rounded to 3,455.6 in the table. This 
computation has three merits: it verifies the tabular results, it reflects 
the striking contrasts among the differences between means of cages re- 
ceiving the same treatment, and it emphasizes the theoretical fact that 
each degree of freedom has a sum of squares all its own. Heterogeneity 
of error variance will be discussed in section 10.13, while the third point 
is the subject of chapter 15. 

The completed analysis of variance increases the suspicion that 
different environments of the cages treated alike have introduced a com- 
ponent of variance into the mean square for cages. In addition another 
component may have been contributed by treatments. As a guide to 

TABLE 10.16 

Estimates of Components of Variance With Major and Minor Classifications 
of Subsample Means. Ego Laying Data 


Source of 
Variation 

Mean 

Square 

Individuals 
per Subclass 

Mean Square Is 
an Estimate of 

Variance Is an 
Estimate of 

Treatment 

1,074 5 

20 

cr 2 + 10^+20(rr 2 

<tV20W/2-W 

Cage 

575 9 

10 

a 2 + 10 of 

<r 2 /10-f<r c 2 

Individual 

297 8 

1 

cr 2 

<7* 

s * « 297.8, s c ' - 

(575.9 - 

297.8)/10 - 27. £ 

!, V - (1,074.5 - 

575.9J/20 = 24 9 


appropriate tests of significance, it is convenient to extend the estimates of 
variance (section 10.6) to include the newly made analysis of subsample 
means. This is done in tabic 10.16. The new feature is the addition of a 
third component of variance by the major classification, treatment. All 
parts of the tabic will be clear upon inspection. 

It is now evident that the cage component of variance, s c 2 = 27.8, 
is almost as great as the variance of the cage mean, s 2 / 10 = 29.78. 
Clearly, the variation of the individual birds is only part of the experi- 
mental error. Apparently the hens were influenced in their production 
of eggs not only by the treatments imposed but also by some extraneous 
conditions in the cages. This means that even in a uniformity trial with 
no real treatments there would be two sources of variation in mean cage 
production, the normal variation of the birds and the environments 
peculiar to the several cages. Both these sources enter the variance at- 
tributable to treatments and experimental error as well. 

There are two null hypotheses whose testing is indicated in table 10.16. 
The one of prime interest is a T 2 = 0, tested by the ratio, 

<7 2 + 10<T C 2 +20(7 r 2 
cr 2 + 10oc 2 
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This is estimated by F = 1,074.5/575.9 = 1,87, with df. = 5 and 6. 
Since F os = 4.39, there is little evidence of treatment effects. 

The experimenter probably would be interested in testing also the 
hypothesis, cr c 2 = 0. For this, the variance ratio, 

cr 2 + 1 Oar 2 

7 5 

is estimated by F = 575.9/297.8 = 1.93, d.f. — 6, and 108, F M = 2.19. 
He might reason this way: “While F is not significant, I have noticed that 
environmental effects usually have appeared in the past. I think <r c 2 
really exists, and I shall re-design both the battery and my experiment in 
order to avoid such effects in the future.” Or, his reaction might be: 
“The laying battery has been used successfully many times, and this is the 
first indication of cr c 2 that has been noticed. 1 think it is merely a freak 
of sampling and that a c 2 is really zero. (Remember the sampling variation 
of table 10.10.) Of course, I know that something unusual may have oc- 
curred this time, like an undetected disease in several of the cages, and 
that the experimental treatments may not have caused the differences, 
but nevertheless I am willing to accept the hypothesis.” 

If the investigator’s experience and good judgment dictate this hy- 
pothesis, then table 10.16 may be revised. Replacing a c 2 by zero, you will 
see that the sums of squares for cages and individuals are estimates of the 
same variance, a 2 , and may be combined into a more reliable estimate. 
This is done by pooling the degrees of freedom (table 10.15), 6 + 108 
= 114, and the sums of squares, 3,455.6 + 32,164.7 = 35,620.3, then 
calculating the mean square, 35,620.3/114 = 312.5. The new analysis 
is in table 10.17. Thus if the experimenter’s decision is correct, treatment 


TABLE 10 17 

Anaiysis of Variance of Egg Laying Dai a Under the Hypothesis, arc 3 — 0 




Degrees of 

Sum of 

Mean 

Source of Variation 

i 

: 

Freedom 

Sq uares 

Square 

Treatment 

1 

5 

5,372 3 

1,074 5 

Individuals 


114 

35,620 3 

312 5 


F = 1,074 5/312.5 = 

3 44** 



effects are significant. To see clearly one thing that is implied, observe 
that there are now 6 X 19 = 114 d.f . for individuals: the 20 birds in each 
pair of cages treated alike are considered as a single lot. If there is any 
extra environmental difference between the pairs of cages it has been 
included in the estimate of cr 2 , increasing it from 297.8 to 312.5. But there 
is a second implication: the effects of any extraneous differences among 
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the cages treated differently are now considered as part of the treatment 
effects. I suspect that, under these circumstances, most investigators 
would hesitate in making decisions about treatments based on the auxil- 
iary hypothesis, <t c z = 0. Instead, they would probably set about the 
reconstruction of the laying battery, adapting it to a more effective 
experimental design. 

What is this more effective design? Not one but several will be pre- 
sented later. At the student’s present stage of advancement he will 
appreciate one suggestion: use more cages with fewer birds per cage. 
This will allow more replications of each treatment. Further details will 
be found in example 10.26. 

10.10 — Analysis of variance of subsample means. Unequal sub- 
sample numbers. It was by design that the cages of laying hens could be 
grouped according to treatment, but the pig litters of section 10.8 have a 
natural grouping by size. The 7 d.f. for litter means may be divided into 3 
corresponding to size of litter, and 4 between litters of the same size. The 
only change in calculation is the usual one for unequal subclass sizes: 
division of the squared sum is by the corresponding k. Since the sums of 
the pig weights in the 4 litter-size groups, easily computed from table 
10.12, are 60.2, 45.1, 26.1, and 21.0 pounds, the sum of squares is 
(60.2) 2 /20 + (45.1)2/16 + (26.1) 2 /12 + (21.0)2/8 - (152.4) 2 /56 - 5.47 
The remainder, 7.48 — 5.47 = 2.01, is attributable to the variation of 
mean birth weight between pairs of litters of the same size. The com- 
pleted analysis is in table 10.18. 

TABLE 10.18 

Completed Analysis of Variance of the Pig Data of Tables 10.12 and 10 13 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Size of litters 

3 

5 47 

1 82 

Litters of same size 

4 

2 01 

0 50 

Individuals 

48 

17 17 

0 36 


Estimates of Variance With Major and Minor Classification 
of Subsamplc Means 

Source of 
Variation 

Mean 

Square 

Individuals Mean Square Is 

per Subclass an Estimate of 

Variance Is 
an Estimate of 

Litter sizes 

1 82 

13 52 cr 2 +6.90o- L 2 -fl3.52(r 5 2 

erVl 3.52 +«r L yi.96 

Litters 

0.50 

6 90 (7 2 -f6.90(r L 2 

<r 2 /6.90-f-<r2, 2 

Individuals 

0.36 

1 <r 2 



x 2 - 0 36, j^O 020, s 8 2 *0.098 
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For the estimates of variance in table 10.18 two averages, k 0 , are 
required. The first, for litters, is the same as that calculated in section 
10.8, k 0 = 6.90. The second, for the size classes, 10, 8, 6, and 4 pigs, has 
n = 4, Sk = 20 + 16 + 12 + 8 = 56 and Sk 2 = (20) 2 + (16) 2 + (12) 2 
+ ( 8)2 = 864. Whence, 

1 / 864 \ 

= 4— 1\ ^ 56 ) ~ 

The variation between litters of the same size derives from two sources, 
differences among the dams producing the litters and the normal sampling 
variation associated with the pig variance, 0.36. The hypothesis that the 
population variance of the former, a L 2 , is zero is tested by 

F = 0.50/0.3 6 = 1.4, d.f. = 4 and 48, F M = 2.56 

So far as the sample is competent, there is little evidence of differential 
ability to produce pigs of distinctive birth weights in litters of the same 
size. 

Variation among pig birth weights in litters of differing sizes involves 
not only both sources evaluated above, but also possible differences in 
available nutrients during the pre-natal period. It might be supposed 
that total birth weight would tend to be constant, with the result that pigs 
in small litters would be heavier than those in large. The hypothesis, 
o\ s 2 = 0, is tested by 

F = 1.82/0.50 = 3.64, d.f . = 3 and 4, F M = 6.59 

The evidence against this hypothesis, then, is slight; but examination of 
the litter-size means, easily calculated from table 10.12 as 3.01, 2.82, 2.18, 
and 2 62 pounds, reveals a trend toward lighter pigs in the smaller litters, 
and this doesn’t seem reasonable. Apparently, the data include too few 
litters for decisions about these variable characteristics of birth weight. 

EXAMPLE 10 18 — In example 6 21 we made two estimates of the population 
variance of soybean heights. If you refer to the original table, 6.1, you can make the 
following analysis of variance: 

Weeks 4 3,120 0 

Individuals 15 130 0 8.67 

The 4 dj. for weekly means may now be subdivided in this fashion: 

Attributable to regression 1 3,097 6 

Deviations from regression 3 22 4 7.47 

These latter sums of squares may be taken from table 6.3 if it is remembered that the 
variance of individuals is k times that of the means in that table. Thus, the sum of 
squares for regression is 4 times 

{Sxy)*/Sx* = (88) a /10 - 774 4, 

while that for deviations is 4(780 — 774 4) = 22.4. The two estimates of o-*, 8.67 
and 7.47, are the same as those derived in example 6.21. 

In the present example, one feature of the experimental design has been ignored — 
the arrangement of the pots in 4 blocks. This will be remedied in the next chapter. 
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EXAMPLE 10.19 — The following data were abstracted from records of perform- 
ance of Poland China swine in a single inbred line at the Iowa Agricultural Experi- 
ment Station. Four pigs were taken from each of 23 litters and fed a standard ration 
from weaning to about 225 pounds. In each litter there were chosen 2 boars and 2 
sows. Every entry in the table is the sum of the average daily gains (pounds per day) 
of 2 pigs of the same sex: 


Sex 

1932 

1933 

Sire 1 Sire 2 Sire 3 Sire 4 

Sire 5 Sire 6 Sire 7 

Boars 

2 768 2 550 2 284 2 357 

2 869 2 743 2 878 

Sows 

2 375 2 574 2 223 2 710 

2 460 2 557 2 622 

Boars 

2 620 3 009 2 716 j 

2 309 2 496 3 055 

Sows 

2 617 2.609 2 742 ! 

i 

2 237 2.483 2 800 

Boars 

2 563 2 776 ! 

2 959 2 777 

Sows 

2 231 2.476 

i 

2 893 2 706 

Boars 

2.575 1 964 

2 918 

Sows 

2 939 2 294 

2 674 

Boars 

2.807 


Sows 

2 203 


Boars 

2.789 


Sows 

2 996 

; 

Boars 

2 591 


Sows 

2 520 



The analysis of variance taken from the original data is: 


Litters 

22 

0 

987070 

Pigs 

69 

1 

109369 

Total 

91 

2 

096439 


Verify the sum of squares for litters by use of litter sums calculated from the table. 


EXAMPLE 10.20 — Subdivide the sum 

of squares 

and degrees of freedom for 

litters as follows: 

Years (F) 

1 

0 083986 

Sires within years (B) 

5 

0.169271 

Litters with common sire (L) 

16 

0.733813 

EXAMPLE 10 21— Subdivide the sum 

of squares 

and degrees of freedom for 

pigs thus: 

Sex within litters (S) 

23 

0 426272 

Pigs of same sex and litter 

46 

0.683097 

EXAMPLE 10.22 — From the foregoing analyses, 

estimate the components of 

variance associated with 


Years k 0 « 43 826 

Jy 2 « 

0 001144 

Sires 1 2 464 

*B* =■ * 

-0 000963 

Litters 4 

s L 2 = 

0 006832 

Sex 2 

ss 2 * 

0 001842 

Pigs 1 

r 2 - 

0 014850 


The mean square for sires is less than that for litters, probably as a result of sampling 
variation. It is reasonable to think that there are real differences in rate of gain 
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among litters by different sires, but this sample furnishes no estimate of <ra*. So far as 
this sample is reliable, the best estimate of ora* is zero. 

Geneticists will recognize the fact that s 2 « 0.014850 is the estimate of genetic 
variance least affected by environment. 


10.11 — Intraclass correlation. It has been shown that the variance 
in a sample may be separated into two parts, one reflecting the natural 
variation of individuals treated alike, and the other arising from genetic 
or environmental conditions peculiar to the subsamples. The former is an 
estimate of <r 2 , which may be assumed equal in the sampled populations, 
while the latter is an estimate of a m 2 , the added portion related to differ- 
ences among the population means. The sum, a 1 + a m 2 , is the variance of 
individuals picked at random from the entire universe made up of the 
sampled populations. The ratio of the two variances, 


is known as the intraclass correlation , p 7 . In the numerator is the variance 
which, in a subsample, is common to all individuals because all of them 
are affected alike by the variation which pertains to the subsample as a 
whole. This is cr m 2 = 25 in the illustrative sampling of table 10.9. In the 
denominator is an average variance that would apply to individuals if 
they were picked at random from the universe without attention to popu- 
lation boundaries. 

The justification for calling this “correlation” may be understood by 
reference to section 7.4 where it was shown that r is the ratio of two 
variances. The numerator in both correlations is associated with the 
variance that is common to individuals: the covariance in one case, and 
cr m 2 in the other. The denominator is an average of the variation that 
would affect individuals if they were not classified — into ,Yi and X 2 , or 
into subsarnples. 

If all subsamples are from the same population, as in the pig gains of 
tables 10.1 and 10.6, then cr m 2 = 0 and consequently p 7 = 0: the entire 
variation, both individual and group, arises from the random sampling of a 
single population. Of course, in any particular sample from p 7 = 0 the 
value of the intraclass correlation r 7 , is not usually zero. Thus, in table 
10.5, r 2 - 114.0, s m 2 = (172.3 - 114.0)/10 = 5.83, and 


5.83 

114.0 +~5?83 


= 0.049 


The hypothesis that p r = 0 is identical with the hypothesis, a m 2 = 0, and 


F - 172.3/114.0 - 1.51, 


a nonsignificant value, is the appropriate test of both of them. 

Large intraclass correlation indicates relatively small variation among 
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the individuals of the subsamples. Indeed, in the limiting situation, 
<r 2 = 0 and 


Since this could happen only if all the individuals in each group had 
exactly the same measurement, it is of only theoretical interest. However, 
the data on identical twins in table 10.19 illustrate a striking approach 


TABLE 10.19 

Number of Finger Ridges on Both Hands of Individuals in 12 Pairs 
of Female Identical Twins 
Data from Newman, Freeman and Holzingcr (11) 


Pair 

Finger Ridges 
of Individuals 

Pair 

Finger Ridges 
of Individuals 

Pair 

Finger Ridges 
of Individuals 

1 

71, 71 

5 

76, 70 

9 

114, 113 

2 

79, 82 

6 

83, 82 

10 

94, 91 

3 

105, 99 

7 

114, 113 

11 

75, 83 

4 

115, 114 

8 

57, 44 

12 

76, 72 



Analysis of Variance 



Source of Variation 

Degrees of Freedom 

Mean Square 

Twin pairs 



11 

817 314 

Individuals 


12 


14 292 


x 2 = 14.292, s m * - 401.51, n = 0 966 


to perfect correlation. The totals of finger ridges are much the same for 
the two members of each pair but differ markedly among pairs. Since 
these counts are presumably unaffected by age, the high twin correlation, 
Tj — 0.966, measures the similarity of their inheritance of this character. 

One can get a good idea of the variation of r 7 in small samples by com- 
puting its values from the random samples of pig variance in table 10.10. 
In sample 1, for example, 

rj = 60/(60 + 127) = 0.321 

Among the 25 samples the smallest intraclass correlation is —0.615 in 
number 20, the largest, 0.765 in number 24. This is ordinary sampling 
variation from the population value, 

Pr = 25/(25 + 100) = 0.2 

As in interclass correlation, there is a small bias in the intraclass. This 
may be corrected approximately by transformation to z (figure 7.4) and 



Section 10.11 


Analysis of Variance: Single Classification 


245 


addition of 1 / {In — 1), where n is the number of pairs. Thus, in sample 1 
above, 

r 1 = 0.321 
Z = 0.333 
l/(2» - 1) = 0.053 


Unbiased z = 0.386 
Unbiased r 7 — 0.368 

If k = 2, intraclass correlations may be averaged and may have fiducial 
limits set as in section 7.6 (4, Chapter 7). 

In order to calculate intraclass correlation directly from a table of 
analysis of variance, it is convenient to put the formula in this form: 

- M 

T ' ~ Mi + (k - \)M ’ 

where Mi and M denote mean squares for subsample means and for indi- 
viduals. In the twin data, Mi = 817.314, M = 14.292 and k = 2. Hence 


817.314 - 14.292 
817.314 + (2 - 1 ) (1 4.292) 


= 0.97 (as before) 


If the subsamples differ in size, the average value, k 0 , is used. For 
example, in the pig data of tables 10.12 and 10.13, 

r, = 1 - ° 7 - - Q - 36 = 0.222 

1.07 + (6.90 - 1 ) (0.30) 

The F-test for table 10.13 shows that this small correlation is significant. 

If the subsample means vary less than would be expected from the 
variation of the individuals, then Mi is smaller than M , a m 2 is estimated 
by a negative number and r 7 is negative. But r 7 cannot approach — 1 
unless k = 2. As can be seen from the formula, 


if Mx — 0, r 7 = 


1 


k - 1 


This indicates why the distribution of r 7 is different from that of r so that 
the test of significance in table 7.3 cannot be applied to intraclass correla- 
tion. 

If one attempts to graph the twin data, he will not know, in any one 
pair, which number of finger ridges is X\ and which, X 2 . The solution is 
to locate 2 points for each pair: the second pair in table 10.19, for example, 
determines the points (79, 82) and (82, 79), symmetrically located with 
respect to a 45° line through the origin (45°, that is, if the scales for X\ 
and X 2 are equal). If there were triplets instead of twins, each set of 
observations, a, b , and c, would specify 6 points, (a, b ), ( b , a), (a, c), (c, a ), 
( b , r), and (r, b). The number of points per subsample rises rapidly with k\ 
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12 for k = 4, 20 for k = 5, etc. For large s, the job of computing r 7 
directly from all these pairs was discouraging until Harris (7) discovered a 
shortened process similar to the later developed analysis of variance (5). 

EXAMPLE 10.23 — Calculate the unbiased estimate of the correlation between 
numbers of finger ridges in table 10.19. Ans. 0.969. 

EXAMPLE 10.24 — From the pig-litter analysis in example 10.19 one can calculate 
M « 0.01607 and Mi — 0.04487 Bearing in mind the 4 pigs per litter, compute 
rj =* 0.31. This may be referred to as the intralitter correlation ignoring the effect 
of sex. 


EXAMPLE 10.25 — The more critical analysis of example 10.22 afFords a number 
of intraclass correlations that may be of interest. The intrasex correlation is 

0.001842 _ A11A 

T ‘ “ 0.001842 + 0.014850 0 1 10 

This is small for weight gains in swine, but would be larger among poultry or rats. 
The intralitter correlation is 


sj} __ 0.006832 

j l 2 '+ +V " 0.006832 + 0.001842 + 6.014850 


0.290 


Interest might be centered in the intralitter correlation among sows only, or among 
boars The original data could be segregated by sex, then reanalyzed. But an average 
correlation could be arrived at in this way: 

sl % _ 0 006832 

Sl } + j* " 6 006832 + 0 014850 

Here, the sampling is within sex only, so that the sex component of variance is omitted 
from the denominator. 


10.12 — Efficiency of an experiment. After he completes the exami- 
nation of some experimental data, the investigator may wonder if the 
experiment might have been conducted more efficiently. The most 
efficient experiment yields the maximum amount of information per 
dollar spent. One rather simple problem of this type is illustrated in 
table 10.20, the results of an effort to learn something about the vari- 
ability of the concentration of carbon dioxide in soil air taken from fallow 
plots of Clarion loam (13). Triplicate determinations were made at each 
of 3 positions per plot. From the table it is clear that the chief source 
of variation is positional: the three determinations in the same place are 
much alike. The question to be considered is whether one could have got 
more information about plot differences by spreading the 9 soundings at 
random over each plot. 

In this type of investigation, one is usually not concerned with the 
plot component of variance because it is an accident of the particular 
plots selected and may not be encountered again. Rather, one asks, 
“What is the average variance of plot means based on 9 determinations 
per plot?” The first answer is calculated for the experiment as it was con- 
ducted. The variance of a positional mean was s 2 / 3 + s P 2 = 48.82/3 
= 16.27; hence, the variance of the mean of 3 positions, assuming random 
selection, was 16.27/3 = 5.42. Second, we may estimate the variance 



Section 10.12 Analysis of Variance: Single Classification 


247 


of a plot mean as it might be if the determinations were randomized in 
the plots. The variance of such determinations was shown (section 10.11) 
to be estimated by s 2 + s P 2 = 1.52 + 15.77 = 17.29. The corresponding 
variance of the plot mean is 17.29/9 = 1.92. Remembering that the 

TABLE 10.20 

Concentration of Carbon Dioxide in the Soil Air of Fallow Plots of 

Clarion Loam 
Parts per 10,000 


Previous Treatment of Plot 

Position on Plot i 

Plot Total 

1 2 3 

None 

16 7 12 

14 10 8 

14 8 10 

44 25 30 

99 

1,500 pounds per acre 
alfalfa plowed under 

1 

8 16 6 

6 16 6 

6 16 6 

20 48 18 

86 

1,500 pounds per acre 
oat straw plowed under 

11 8 6 

12 6 6 

12 8 9 

35 22 21 

78 


Total 263 


Analysis of Variance of Concentration of 
Carbon Dioxide in Soil Air 


Source of V ariation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Total 

26 

345 19 


Plots 

2 

24 97 

12 48 

Positions within the plots 

6 

292 89 

48 82* 

Determinations at same positions 

18 

27.33 

1 52 


Components of Variance 


Source of 
Variation 

Mean 

Square 

Number of 
Determinations 

Mean Square Is 
an Estimate of 

Variance Is an 
Estimate of 

Positions 

48.82 

3 

<7 2 -f 3cr/> 2 

a 2 / 3 + ap 2 

Determinations 

1 52 

1 

a 2 

a 2 


^ - 1.52, 

rp 2 - (48.82 - 

1 52)/3 - 15.77 
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information from an experiment has been defined as inversely propor- 
tional to the variance, we reach the conclusion that the information 
could have been increased by 


J 1_ 

1.92 5.42 

i 

5.42 


= 182% 


if the soundings had been scattered randomly over the plots. You may 
object that it would cost more to randomize the determinations, and this 
may be true. The exact calculation of maximum information per unit 
cost is beyond the scope of this text, but we shall consider other features 
of it, and cite references, in chapter 17. At the moment we have learned 
that, in this instance, determinations in triplicate were statistically ineffi- 
cient as compared to the same number of determinations made singly. 
This is because of the small variation in the measurement of concentra- 
tion as compared to the positional variability of the concentration itself. 

Owing to the prevalence of multiple determinations, we should per- 
haps emphasize the method just used. Let us consider three stages that 
are common. First, two or more samples of some material are collected 
at different places. Next, there are randomly taken for analysis two or 
more subsamples from each sample. Finally, two or more readings are 
made on each subsample. The estimates arc like this: 


Samples 

Subsamplcs 

Readings 


cr 2 + k\G + A: oas 2 
* 2 + k\G /j 2 


The readings may be made on a single batch of material that has under- 
gone some more or less elaborate chemical or biological processing: the 
variation among them, like that among the triplicate determinations in 
tabic 10.20, may have only minor sources such as differences in counts of 
bacteria on parts of the same plate. Subsamples, however, would pre- 
sumably be prepared separately. They would differ not only because of 
the usual sampling variation in the material itself, such as from one posi- 
tion to another in table 10.20, but also because of small irregularities in 
the laboratory procedures. Together, these make up o B 2 . This kind of 
variation is illustrated by blood counts made from two different punctures 
of the same individual, the material from the punctures being handled, 
say, by two technicians. Finally, there is the variation among sources of 
the material. These might be truck loads of ear corn, a subsample being 
drawn from each load, or churnings of butter from each of several vats 
of cream. If this cumulative variation among samples is the source of 
experimental error, the effort should be made to adjust k\ and Aj 2 so that 
maximum information may be obtained for the time and effort involved. 
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In table 10.20 there is another feature that deserves notice. The 
intraposition correlation within the plots is negative — 

12.48 — 48.82 

U ~ 12.48 + (3 - 1)48.82 = ~°' 3J 

One may ask if this correlation is significant. But the value of F, 12.48/ 
48.82 = 0.26, is not found in the table. The test is easily made, however, 
by this device: compare the reciprocal of F, 48.82/12.48 = 3.91, with 
the tabular values corresponding to the degrees of freedom in reverse ; that 
is, ^i = 6 and n 2 — 2. The 5% point, 19.33, suggests that our negative 
correlation may be merely an accident of sampling from a population in 
which p 7 = 0. Perhaps if one extended his investigation to other plots 
wuth the same cultural history, all on Clarion loam, of course, he would 
still find no evidence of a plot component of variance. 

EXAMPLE 10 26— What would be the effect on the variance of treatment rr^ans 
if the egg laying experiment were carried out with 4 cages per treatment, 5 hens per 
cage** Assuming (tabic 10 16) that s* = 297.8 and s e 2 = 27.8 remain unchanged, the 
variance of the cage mean would increase to s 2 / 5 s 2 — 87.4 instead of 575 9/10 
— 57.6. But the variance of the mean of 4 such cages would be 87 4/4 = 21 8, a 
decrease of 24% from the variance of the mean of the 2 cages used in the experiment. 
7’his indicates 24% more information from the same birds, or 24% less birds for the 
same information. 

10.13 — Tests of homogeneity of variances. From lime to time wc 
have raised the question as to whether two or more variances may differ 
significantly. For instance, in example 4.3 the variances in two lots of 
honeybee data seemed notably different, and you were cited to the present 
section for a test. Since the lots were equal in numbers of bees, the F-test 
is used with only this modification: calculate F as the quotient of the 
larger variance by the smaller, then double the tabular probabilities. 
The first part of the rule arises from inability to label either variance with 
the sources, “subsample means” or “individuals.” The second follows from 
the fact that F so calculated is always greater than 1, so that only the upper 
part of the distribution is used. In the two lots of bees the variances were 
0.0267 and 0.5889, hence 

F = 0.5889/0.0267 = 22.06, 

far beyond what is now the 2% point, 5.35. 

If n i is not equal to n 2 , assign n\ to the larger mean square. Consider 
the other sample of bee data in table 4.7. The two variances were 0.0653 
and 0.4133 with d.f . = 19 and 9. F then is 0.4133/0.0653 with n\ = 9 
and n 2 = 19, to be compared with the 2% point, 3.52. Since the sample F 
is larger than this tabular value, the null hypothesis may safely be rejected. 

Bartlett’s test of homogeneity (1) is available if there arc either 2 or 
more samples of variance, the tests indicated above being suitable for 2 
samples only. If the samples are equal in size, the test involves the com- 
parison of n times the logarithm of the average variance with the sum of 
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logarithms of the separate variances. In table 10.21 the method is applied 
to the 4 subsamples of s 2 in table 10.1 — subsamples known to have been 
drawn from a common source. The factor, 2.3026, is a constant (log*10) 
necessary because common logarithms are used. Chi-square, as first 
calculated, is slightly biased upwards. In this example, even the biased 
value shows less than average sampling variation in s 2 (table 9.1). It is 
necessary to make the correction only if x 2 lies close above one of the 

TABLE 10.21 

Computation of Bartlett’s Test of Homogeneity of Variance. 

Samples E^ual in Size 
Data from table 10.1. n — 4, k = 5 


Sample 

Sum of Squares, Sx 2 

Variance, j 2 

log s 2 

1 

472 

118 

2 07188 

2 

396 

99 

1 99564 

3 

616 

154 

2 18752 

4 

164 

41 

1 61278 



£r 2 = 412 

S log j 2 = 7 86782 

Mean variance = S 2 — 6i*/n — 412/4 = 103 

logs 2 = 2 01284 


n logs 2 = 4(2.01284) 

- 8 05136 



S log s 2 

= 7 86782 



Difference 

0 18354 



X 2 = 2 3026 (k — 1 ) ( n log S 3 — S log s 2 ) 

- 2.3026 (5 - 1) (0.1 8354) 

= 1 69, dj. « n - 1 = 3. 

Correction factor = C = 1 + — = 1 -f “ hi 042 

3 n(k - 1) 3(4)(5 - 1) 

Corrected x 2 - xVC = 1 69/1 1042 - 1.53 

critical tabular values, and then only if one wishes to get an accurate 
evaluation of P. 

For samples of differing size, the computation of chi-square is more 
complicated though following the same pattern. In table 10.22 the test 
of homogeneity is applied to the pig birth weights of table 10.12. Each 
Sx 2 is calculated from the corresponding SX 2 by deducting the usual 
(SX) 2 /k. The table requires no further explanation. 

Another anomaly in the pig birth weights now stands out: the differ- 
ing variances do not seem to have any physiological explanation. Both 
the largest and the smallest were in the 10-pig litters, and the others 
appear to occur at random. Coupled with the unexplained sizes of the 
birth weights (section 10.10), this leads to the conclusion that our sample 
contains little information. 

In addition to these biological peculiarities, there is a statistical feature 
that modifies some of the assumptions made in section 10.10. The estimate 
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TABLE 10.22 

Computation op Bartlett’s Test of Homogeneity of Variance. 
Samples Differing in Size 




Pig birth weights from table 10.12 



Sample 

Sx* 

Degrees of 
Freedom 
k- 1 

Reciprocal 
1/(A — 1) 

Variance 

j 2 

log s 2 

(k 1 )log i 2 

1 

8 18 

9 

o mil 

0 9089 

-0 04148 

- 0 3733 

2 

3 48 

7 

0 14286 

0 4972 

-0 30347 

- 2 1243 

3 

0.68 

9 

0.11111 

0 0756 

-1 12148 

-10.0933 

4 

0.72 

7 

0 14286 

0 1029 

-0 98758 

- 6.9131 

5 

0.73 

5 

0 20000 

0.1460 

-0 83565 

- 4.1782 

6 

0 24 

3 

0 33333 

0 0800 

-1 09691 

- 3 2907 

7 

1 97 

5 

0 20000 

0 3940 

— 0.40450 

- 2.0225 

8 

1 17 

3 

0 33333 

0 3900 

-0 40894 

- 1.2268 

n = 8 

17 17 

133 Sx* 

48 

= S(k- 

1.5746 
» _ 1 

A’(*-l)(logf») 

- -30.2222 


S 2 - Sx*/S(k - 1) - 17.17/48 = 0.3577 

(logS*)5(* - 1) = (— 0.44648)(48) = -21.4310 

X 5 - 2. 3026 [(log fi)S(k - 1) - S(k - 1 ) (log f 1 )] 
«= 2. 3026[ — 21.4310 - (-30.222)] 

= 20.24 


Correction factor, C - 1 + 3 ^ 1 ) ~ 


1 +■ 


1 


(3) (7) 

Corrected x 2 - 20.24/1.074 - 18.85** 


1.5746 - 


48 


dj 


1.074 

= ft - 1 - 7. 


of individual variance, s 2 = 0.36, is an average of variances that may be 
drawn from diffeient population values of cr 2 . Hence, the hypothesis 
tested in table 10.18 is not the simple one, cr m 2 = 0, but the more general 
one — “The litters are drawn at random from the same normal population; 
that is, there is no population difference in cither rn or cr.” Since we now 
have evidence that the second part of the hypothesis is untrue, we can 
no longer be certain about the first part. 

As for the pig litter data, the matter is unimportant: there was no 
understandable trend in either the birth weights or the variances, so that 
the sample appears uninformative. In other experiments there would be 
several considerations of interest. The first question that should be 
answered is, “What is the meaning of the different variances?” Pursuit 
of that lead might be more profitable than any findings about the means. 
If not, the natural conclusion would seem to be that differences among 
variances are only sampling phenomena, the ordinary test of the hy- 
pothesis, a m = 0, being valid. If that isn’t reasonable, the experiment may 
be divided into two or more parts, each with homogeneous variance, and 
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each part analyzed separately. Comparison of two means with different 
variances would be made as in section 4.6. 

10.14 — Summary. Analysis of variance has been applied to two or 
more groups of either the same or different sizes. The analysis furnishes 
estimates of population variances, (i) of individuals, (ii) of group means, 
and (iii) of pertinent components. It also provides for the F- tests of various 
null hypotheses; of a m = 0 if a is the same in all groups, and more gen- 
erally of a common normally distributed source of all subsamples. Chi- 
square is used to test the hypothesis of homogeneous variance. Finally, 
analysis of variance is a convenient method of computing intraclass 
correlation. 

The mathematical model for the analysis of variance of groups is this: 
the groups shall be random samples from normally distributed popula- 
tions. The probabilities associated with F are only slightly disturbed 
by moderate departures from normality. 

The mean square for individuals within subsamples is a valid estimate 
of experimental error only if the experiment is conducted so that there 
would be no intraclass correlation in uniformity trials. If Jots are kept in 
separate places their environments are likely different, with the conse- 
quent underestimate of experimental error and exaggeration of signifi- 
cance. 
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Chapter 11 


ANALYSIS OF VARIANCE WITH TWO OR MORE CRITERIA 
OF CLASSIFICATION 

11.1- — The use of knowledge about outcome. The investigator’s 
ability to forecast differential performances among his experimental 
units has been ignored in the foregoing chapter. The random allotment of 
individuals to groups is appropriate if there is no knowledge of outcome. 
But in many cases it is known that units treated alike will behave differ- 
ently: young male rats gain weight faster than females in identical 
environments, while plants in one section of a greenhouse bench produce 
less flowers than replicates in other sections. If units of both kinds are 
assigned to a lot for the purpose of evaluating the effect of a treatment 
the experimental error will contain not only the unpredictable variation 
due to sampling but also an increment from the predictable behavior of 
the individuals. It has been emphasized earlier in this text that experi- 
ments can be designed to eliminate from error the portion of variation 
which can be anticipated. For two treatments the pairing of chapter 2 
was effective. We are now to learn corresponding methods for three or 
more treatments. 

11.2 — Randomized blocks on a greenhouse bench. In the soybean 
experiment of section 6.2 it was indicated that of the 4 plants measured 
in any particular week one was chosen at random from each of 4 blocks. 
The block consisted of 5 plants in adjacent pots: these were arranged on 
the bench so as to warrant the assumption that the plants would grow 
alike. The layout is shown in figure 11.1. 

Though the 5 pots of any one block were placed so that environmental 
conditions (temperature, air currents, humidity, light) were as nearly 
identical as possible, these conditions were expected to be different for 
the several blocks: that they were can be seen by comparison of the heights 
in, for example, the fifth week. The rejection of this block variation from 
experimental error is the object of the statistical method now to be 
explained. 

For convenience of examination as well as to facilitate computation, 
the soybean data are assembled (table 11.1) by block and date. The 
randomization is no longer evident, but the weekly heights and the block 
differences are brought into prominence. Sums of squares corresponding 
to these two sources of variation arc to be separated from the total by 
means of analysis of variance. 

Computation follows the pattern already familiar. First, the sum of 
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Flo. 11.1 — Five pots in each of 4 blocks. The pots were arranged on the bench in the 
order shown. The upper figure for each pot is the week randomly chosen for measure- 
ment, the lower is the height of the soybean plant (centimeters). 


squares for the entire experiment is the sum of the squares of the 20 
heights, corrected for mean: 

(4) 2 + (3) 2 + (6) 2 + (7) 2 + (18) 2 + . . . + (38) 2 - (480) 2 /20 = 3,250 

This is entered in table 11.2 along with the associated 19 degrees of free- 
dom. 


TABLE 11.1 

Heights (Centimeters) of Soybean Plants in 5 Weeks. 
Four Blocks or Replications 


Block 

Week 

Sum 

Mean 

1 

2 

3 

4 

5 

1 

4 

18 

26 

38 

44 

130 

26 

2 

3 

19 

25 

35 

43 

125 

25 

3 

6 

18 

24 

28 

39 

115 

23 

4 

7 

13 

21 

31 

38 

110 

22 

Sum 

20 

68 

96 

132 

164 

480 


Mean 

5 

17 

24 

33 

41 


24 
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TABLE 11.2 

Analysis of Variance of Soybean Heights in Randomized Blocks Experiment 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Total 

19 

3,250 


Weeks 

4 

3,120 

780. 

Blocks 

3 

50 

16 67 

Remainder (discrepance) 

12 

80 

6 67 


Next, there is the sum of squares for the weekly means, 

(2 Q) 2 + (68) 2 + (96) 2 + (132) 2 + (164) 2 _ (480) 2 = 

4 20 

This is 4 Sy 1 as computed in table 6.3, 4 being the number of plants 
measured each week, and corresponds to the sum of squares for sub- 
samples in chapter 10. 

There is now a third sum of squares for block means: 

(130) 2 + (125) 2 + (115) 2 + (110) 2 (480) 2 _ ^ 

5 20 

Notice the divisors in the denominators: as usual, these are the numbers 
of heights whose sum is squared in each term of the numerator. 

In table 11.2 it is clear that the degrees of freedom for weeks and blocks 
fall short of the total, and that the same is true for sums of squares. The 
nature and utility of the remainder, discrepance , is to be discussed at some 
length. 

The growth of the plants is reflected in the large mean square for 
weeks: most of the variation among the 20 plants is due to normal increase 
in height. The much smaller mean square for blocks is attributable to 
the differences among rates of growth in the several positions on the 
bench. As for the mean square called discrepance, we shall find that it 
is associated with the sampling variation ordinarily thought of as experi- 
mental error, and that in this experiment it estimates the same variance 
as did s v x 2 of chapter 6. 

For those who prefer symbolical summaries, the data, computing 
instructions and analysis of variance in table 11.3 will be useful. 


11.3 — The nature of discrepance. In a two-way table like 11.1 each 
entry is characterized by two criteria, one determining the rows (blocks) 
and the other the columns (weeks). From the border means an expected 
value may be calculated for each cell of the table, and this may then be 
contrasted with the observed entry in a manner analogous to that of 
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TABLE 11.3 

Symbolical Representation of a 2-way Table With r Rows and c Columns. 
Computing Instructions and Analysis of Variance 



Column 


Row 

1 

2 

c 

Row Total 

1 

AT„ 

Ah, 

X\ e \ 

Si. 

2 

Xw 

A» 

A 2c 

s t . 



A'., 


s t . 

r 

in 

A"rt 

ire 

S r . 

Column total 

S.\ 

A. a S.j 

S.< 1 

1 

s 


Correction term = S 2 / re 


Sums of Squares 


Total: 

Rows: 

Columns: 

Discrepance: 


7 — S(X t1 ) 2 — correction 
,, 5, * + S 2 2 + • .Sr.* 

A = — - — — correction 

c 

^ S I* + £** + ... + 

C ss correction 


D = T - R - C 


Analysis of Variance 



Decrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Rows 

r - 1 

R 

R/(r - 1 ) 

Columns 

c - 1 

C 

C/(c - 1 ) 

Discrepance 

(r - 1)0 ~ 1) 

I) 

D/(r - l)(r 

Total 

TC — 1 

1 - R + C 4- D 



EXAMPLE 11.1 — Here are some data simulating the soybean experiment. 'I ney 
arc chosen for ease in computation. Analyze the variance. 
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Degrees of 

Sum of 

Mean 

Source of Variation ; 

Freedom 

Squares 

Square 

Age 

3 

60 

20 

Blocks 

2 

32 

16 

Discrepance 

6 

24 

4 

Total 

11 

116 



sections 9.5 and 9.8. The expected value is determined from a kind of 
regression equation: 

£ = x t . + z.j — x, 

in which the subscripts, *. and . j9 designate the row and column. To 
illustrate, take *. = 3 (third block) and = 4 (fourth week) of the 
soybean data. Here x t . = 23, x.j = 33 and x = 24; hence, for this cell 
of the table 

t = 23 + 33 - 24 = 32 

Now, X — £ = 28 — 32 = — 4 is the deviation of the regression value 
from that observed, and this is attributed to sampling variation. Finally, 
the sum of the squares of the 20 deviations is the same as discrepance in 
table 11.2. All of the regression values and deviations, calculated as 
above, are set out in table 11.4. 


TABLE 114 

Observed and Regression Values Together With Deviations, X — 
in the Soybean Experiment 











Week 









Block 


1 



2 




3 



4 




5 



X 

A A’ 

-X 

A' 

X 

X 

-X 

X 

X X- 

X 

X 

X 

A"- 

-X 

X 

XX 

-A' 

1 

4 — 

7 = 

— 3 

18- 

-19 

— _ 

-1 

26 

-26 = 

0 

38- 

-35 

=; 

3 

44- 

-43- 

1 

2 

3- 

6 = 

— 3 

19- 

-18 

= 

1 

25 

-25 = 

0 

35- 

-34 

= 

1 

43- 

-42 — 

1 

3 

6- 

4 = 

2 

18- 

-16 

= 

2 

24 

-23 = 

1 

28- 

-32 

= - 

-4 

39- 

-40 = 

-1 

4 

7- 

3 = 

4 

13- 

-15 

= - 

-2 

21 

—22— — 

1 

31- 

-31 


0 

38- 

-39 = 

-1 

1 

& 

■ *) 1 


-3) 2 

+ (- 

3) 1 

4- 

(2) 2 

+ 

(4) 2 + ( 

- 

l) 2 -f 


+ 

(- 

D 2 = 

- 80 



The notable features of table 11.4 arc: (i) The sum of the deviations 
in each row and also in each column is zero. This implies that the sum 
of the regression values in each row (or column) is the same as the sum 
of the observed, (ii) As already noted, the sum of the squares of the 
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deviations is identical with the sum of squares for discrepance, table 11.2. 
(iii) The regression values form a pattern of differences that is the same 
for every column. As examples, look at columns 1 and 3: 


Column 1 Column 3 


7 — 6 = 1 26 - 25 = 1 

6-4 = 2 25 - 23 =2 

4 - 3 = 1 23 - 22 = 1 


The succession of differences, 1, 2, 1, is the same in all the 5 columns. A 
similar statement is true for rows, the pattern of differences being —12, 
— 7, —9. (iv) The discrepance is caused by the failure of the observed 
values to follow such patterns of differences. This fact can be used to 
form a rough estimate of discrepance in 2-way tables; observe the differ- 
ences in the several columns — if they follow similar patterns, the dis- 
crepance is small, (v) It is evident from (i) and (iv) that analysis of 
variance of the regression values would result in the same sums of squares 
for weeks and blocks as those in table 11.2, but in zero discrepance, 
(vi) Because the sum of the c deviations in every row must be zero, only 
c — 1 of them are independent — the same argument that was used in 

EXAMPLE 11.2- — A set of data arranged for easy computation is the following: 


| Column 

Row | 1 2 

1 6 5 4 

2 15 10 8 

3 15 15 12 



Analysis of Variance 


Degrees of 

Source of Variation j Freedom 


Mean 

Square 


Rows 2 

Columns 2 

Discrepance 4 


63 
12 
2 5 


Calculate the sum of squares for discrepance by use of the deviations from regression. 

EXAMPLE 11.3— In the foregoing example deduct 4 from every entry, then re- 
calculate the analysis of variance As you may remember, this type of coding does not 
affect variance (section 5 4). Do the computation again after subtracting the mean, 10, 
from each datum: this kind of coding is often convenient if a calculating machine is 
not handy. 

EXAMPLE 11 4 — Turn forward to table 11 .5, calculate deviations from regression 
values in each of the 20 cells and verify the fact that the sum of their squares is 1,212. 
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EXAMPLE 11.5 — There is no better practice in the mechanics of these 2-way 
tables than constructing one for easy calculation. Try it. You will find that it is well 
to start with deviations from mean. 

EXAMPLE 1*1.6 Arrange a table similar to that of example 11.1 but satisfying 
this condition; the sum of squares for blocks shall be zero. 

EXAMPLE 11.7 — Make up another table having all its variation in discrepance: 
the sums of squares for both rows and columns equal to zero. 

EXAMPLE 11.8 — If you are interested in the mathematics, you will find it profit- 
able to verify the addition theorem for sums of squares, 

« cSx t * + rSt.* + Sd*, 

where x, 3 s ,. and £./ are deviations from St, and d is deviation from the regression value. 
Start with the equation, 

X « x t . -f St., — St + d 

sections 2.10 and 9.8. Similarly, there are r — 1 independent deviations 
(or degrees of freedom) in every column. Altogether, then, the degrees 
of freedom for the rc deviations are (r — 1 ){c — 1). 

In following sections we shall make various statements about proba- 
bility. These are all based on the hypothesis that the deviations in the 
2-way table constitute a random sample from a normally distributed 
population with m = 0. The mean square for discrepance is an estimate 
of the variance, a 2 , in this population. 

The hypothesis of independence of the deviations implies, among 
other things, that in the population there is zero correlation between 
these deviations in any pair of rows or columns. Of course, if you calculate 
some of these correlations in table 1 1 .4, you will find plenty of sampling 
variation from zero; but the values will tend to center on zero in the 
customary distribution of small sample correlations. The hypothesis of 
independence of deviations has been misunderstood by some, apparently, 
because the statement is often made that analysis of a 2-way table is not 
valid if the original entries are correlated. A glance at the regression 
values in table 11.4 will convince you that they are perfectly correlated, 
so that a high correlation is to be expected among the A'’s (remember 
common elements, section 7.5). One object of the analysis of variance 
is to segregate the correlated variation in the rows and columns from the 
independent part which estimates a 2 . 

11.4 — Mean squares as estimates of population variances. Tests of 
null hypotheses. An easy way to visualize the facts about estimates is to 
outline a 2-way table then fill the cells by random drawing from a normal 
distribution like table 3.1. That is the way table 11.5 was made, with 
slight modifications to insure ease of calculation. This particular random 
sample was selected from a number of others because it illustrates the 
average unusually well. 

The computations proceed as in section 11.2, the row T and column sums 
being easily supplied. Following the argument of section 10.2 it may be 
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TABLE 11.5 

Entries Drawn at Random From the Pig Gains of Table 3.1 


Outcome Level 

Treatments 

Means 

1 

2 

3 

4 

1 

15 

31 

30 

20 

24 

2 ! 

22 

11 

26 

45 

26 

3 

37 

30 

21 

36 

31 

4 

18 

31 

34 

49 

33 

5 

33 

37 

44 

30 

36 

Means 

25 

28 

31 

36 

30 


Analysis of Variance 





Degrees of 

Sum of 

Mean 

Source of Vaiiation 

F reedom 

Squaies 

Square 

Treatments 

3 

330 

110 

Outcome levels 

4 

392 

98 

Discrepance 

12 

1,212 

101 

Total 

19 

1,934 

101 8 


said that each of the four mean squares is an estimate of the population 
variance, known to be <j 2 = 100. The First three are independent of each 
other, while the fourth is a weighted mean of these first three. 

Table 11.5 may be looked upon as simulating a uniformity trial with 
swine. Each dummy treatment is applied to a lot of 5 animals, but in 
an actual experiment these would not be assigned to the lots wholly at 
random. Instead, the animal husbandman’s knowledge of outcome would 
be utilized. Four pigs would be selected for level 1 because they are 
judged capable of gaining at the same rate if treated alike. These 4 would 
then be randomly assigned to the treatments. Thus, the quadruplet 
at each outcome level is an expansion of the pair in the tobacco virus 
experiment of section 2.13: many treatments are now possible instead of 
only two. Experimental error is evaluated by discrepance — the failure 
of the 4 pigs in any level to gain in accord with the pattern set by the 
border means — just as the error in the two-treatment design was a measure 
of the failure of the pair differences to be identical with the mean differ- 
ence. 

Beginning in section 10.6, we learned that mean squares may be 
estimates of several components of variance. The same is true in 2-way 
tables, as indicated in the first part of table 11.6. One new feature 
appears — the row and column components are independent of each other, 
neither component appearing in the other mean square. Note: the co- 
efficient of the row component, cr# 2 , is equal to the number of columns, 
and vice versa. 
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TABLE 11.6 

Population Components op Variance Estimated By 
Mean Squares in a 2-way Table 


Source of 
Variation 

Mean Square 
Is an 

Estimate of 

Sample Statistics in 

Tabic 11.5 

Table 11.2 

Row ( R ) 

<r 2 4~ can 2 

. . 98-101 ft7 . 

r = 4, s R -= — = —0 7o 

c = 5, s r 2 = 2.00 

Column ( C ) 

<r 2 -f- r<rc a 

_ , 110-101 
r = 5, s c - =■ — = 1.80 

r = 4, r c 2 = 193.33 

Discrepance 

<r 2 

* 2 -101 

i 

r 2 - 6 67 


Note: c and r are the numbers of entries in the rows and columns respectively. 


The computations of the components of variance, much the same as 
those learned in the previous chapter, are applied in the other two parts 
of the table to the mean squares of tables 11.5 and 11.2. As would be 
expected in the uniformity data of the former, there are only small com- 
ponents for both rows and columns, and these represent no more than 
accidents of sampling. As a contrast, the component for age in the soy- 
bean experiment of table 11.2, sf z = 193.33, reflects the chief source of 
variation, the growth of the plants. 

You have doubtless already guessed that the hypotheses, a R 2 = 0 and 
of 1 = 0, are those tested by F. As an example, the test of the (wholly 
unrealistic) hypothesis that there is no change of soybean height with 
age is 

r + rs c 2 mean square for treatments 
s 2 mean square for discrepance 

= 780/6.67 = 117, df. = 4 and 12 

Significance is obvious. There may be interest, also, in learning if the 
block differences are attributable to sampling from a uniform population 
environment. F — 16.67/6.67 = 2 50, d.f. = 3 and 12, F os — 3.49: the 
evidence against this null hypothesis is not convincing. 

EXAMPLE 11 9 — Hoffman (11) and Lowe (14) investigated the flavor of the 
breast muscle of Buff Orpington poultry subjected to the following 4 treatments: 

a Drawn 2 hours after killing, then frozen and stored. 

b. Frozen 2 hours after killing, stored 10 days, thawed, drawn, refrozen, and stored. 

c. Frozen 2 hours after killing, stored 30 days, thawed, drawn, refrozen, and stored. 

d. Frozen 2 hours after killing, stored 90 days, thawed, drawn, and refrozen. 

The chief distinction was in the period elapsing between killing and drawing. All were 
stored 90 days before cooking started. There were 7 replications of the 4 treatments — 
28 birds in all. Four judges scored the flavor 

Owing to parallel treatments not here reported, only one chicken could be cooked 
and scored per day. It was anticipated that the flavor might change during this 28-day 
period, hence one replication was completed each 4 days, the 4 treatments being taken 
in random order during each block of days The sums of the 4 scores per bird are 
shown in the table. 
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Treatment 



Block of 4 Days 



1 

2 

3 

4 

5 

6 

7 

a 

20 

18 

16 

17 

16 

14 

16 

b 

18 

18 

16 

20 

15 

10 

16 

c 

20 

18 

16 

18 

18 

15 

13 

d 

20 

16 

20 

17 

17 

14 

19 

Show that, for treatment, F = 

0.80. There is no evidence 

that the flavor was 


affected by the time elapsing before drawing. For blocks, F = 4.97. Clearly the flavor 
did decline during the 28-day period of testing. 

EXAMPLE 11 10 — The following arc average daily gains of 4 lots of swine re- 
ceiving different rations (20). The animals were divided into 10 outcome groups. It 
was thought that the four in each group would gain at equal rates if fed the same. 


Outcome 

Group 

Ration 

1 

2 

1 

1 40 

1 31 

2 

1 79 

1 30 

3 

1.72 

1 21 

4 

1 47 

1 08 

5 

1 26 

1 45 

6 

1 28 

0 95 

7 

1 34 

1 26 

8 

1 55 

1 14 

9 

1 57 

1 25 

10 

1 26 

1 00 

Sum 

14 64 

11 95 

Mean 

1 464 

1 195 


Number 



3 


4 

Sum 

1 

40 

1 

96 

6 

07 

1 

.47 

1 

77 

6. 

.33 

1 

37 

1 

62 

5 

92 

1 

15 

1 

76 

5 

46 

1 

22 

1 

88 

5 

81 

1 

48 

1 

50 

5 

21 

1 

31 

1 

60 

5 

51 

1 

27 

1 

49 

5 

45 

1 

.22 

1 

77 

5 

81 

1 

36 

1 

27 

4 

89 

13 

25 

16 

62 

56 

46 

1 

325 

1 

662 




One pig from each outcome group was randomly assigned to each ration. So far as the 
animal husbandman’s judgment is good this scheme insures a balance of high and low 
gainers in the lots On the other hand, since every pig has the same chance of receiving 
any one ration, there is no bias in either the means or the experimental error. Analyze 
the variance as follows: 



Degrees of 

Sum of 

Mean 

Source of V ariation 

Freedom 

Squares 

Square 

Total 

39 

2 2309 


Outcome groups 

9 

0 4074 

0 0453 

Rations 

3 

1 1986 

0 3995** 

Discrepance (error) 

27 

0 6249 

0 0231 


For rations, F ~ 0.3995/0 0231 = 17.3, F n — 4 60. In the test of outcome groups, 
F « 0 0453/0.0231 = 1.96, F 0 s = 2.25. The increase in efficiency over random 
assignment to lots will be evaluated in example 11.15. 

EXAMPLE 1 1.11 — Try computing the analysis of variance of table 11.5 by use of 
deviation of both items and means from the overall mean, 30. Remember that the 
variance of items, estimated from means, is k times the variance of the means: for 
row means, k *» 4, for columns, 5. 
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EXAMPLE 11.12 — Calculate the sum of squares for columns in example 11.11 by 
taking the deviations of the column sums, 125, 140, 155, and 180 from their mean, 150. 
You will observe that the variance of items is one-fifth the variance of sums. It is well 
to keep in mind the relations, 

^(variance of means) — mean square = (variance of sums)//t 

EXAMPLE 11.13 — The facts of the two preceeding examples are hard for the 
beginner to keep in mind. In tables 10.11, 10.14, and 10.16 the contrast between 
mean square and variance was emphasized. The following statements about table 
11.6 may help: (i) The variance of means, in the rows lor example, is 

b 2 4 cs R 2 )/c - s'/c + 

(ii) The variance of the row sums is 

c{s z -f CS R 2 ) = CS 2 4- C 2 Sr 2 

11.5 — Analysis of variance applied to 2 treatments in randomized 
blocks. The comparison of individuals in section 2.13 is a special case 
of the 2-way table, and may be thought of as a randomized blocks experi- 
ment with 2 treatments. The tobacco virus data of table 2.2 are copied 
in table 11.7. Variance is then analyzed by the methods of the present 
chapter. 

TABLE 11.7 

Numbers of Lesions on Halves of 8 Tobacco Leaves 
Two Treatments Replicated on 8 Plants 
Copied from table 2.2 



Treatments 

Diffei ence 

Sum 

Plants 

X\ .V 2 

A', - X x 

X\ 4 A r i 

1 

9 10 

-1 

19 

2 

17 11 

6 

28 

3 

31 18 

13 

49 

4 

18 14 

4 

32 

5 

7 6 

1 

13 

6 

8 7 

1 

15 

7 

20 17 

3 

37 

8 

10 5 

5 

15 

Sum 

120 88 

32 

208 

Mean 

15 11 

4 

13 


Analysis of Variance 


Degrees of 

Source of Variation Freedom 

Sum of 
Squares 

Mean 

Square 

Treatments 

1 

64 


Plants 

7 

575 

82 2 

Discrepance 

7 

65 

9.29 

Total 

15 

704 



lest for treatments: F = 64/9.29 = 6.89 — (2 63) 2 — l 2 (section 2.13) 
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The sum of squares for treatments can be got in the usual fashion, 

(120) 2 + (88) 2 (208) 2 _ 

— s ir “ 64 ' 

or by the special formula mentioned in section 10.5, 

(120 - 88) 2 

~ 16 ~ ~ 64 

As regards discrepance, its variance may be inferred from that of the 
differences, 18.57, in table 2.2. Since the variance of differences is an 
estimate of 2<r 2 , then a 2 is estimated by 18.57/2 = 9.29. The two methods 
of computation, therefore, lead to the same statistic. 

As has been noted before, if n\ = 1 d.j. then t 2 — F as indicated in the 
last line of table 11.7. 

The method of the present chapter leads to one result not available 
in chapter 2: the estimate of variance, 82.2, based on plant means. This 
large and highly significant mean square is the justification of the ran- 
domized blocks design — the investigator circumvented the variation from 
plant to plant by trying the preparations on halves of the same leaf. 

The efficiency of this experiment was discussed in section 3.10, but 
reference had to be made to the present one for more accurate evaluation. 
Yates (28) gave an easily remembered rule for reconstructing the table 
of analysis of variance so as to arrive at an unbiased estimate of the 
variance as it would be if the sampling of the half leaves on the 8 plants 
were random: substitute the error variance, 9.29, for the mean squares 
of both error (7 d.f.) and treatments (1 d.f.), then calculate the corre- 
sponding sum of squares, (8) (9.29) = 74.3. In table 11.8, this is added to 
the sum of squares for plants (blocks) and the new total divided by the 
total degrees of freedom. Thus, the information in this experiment as 
compared to the random groups of chapters 4 and 10 is 466%. 


TABLE 11.8 

Reconstruction of Table 11.7 to Get Estimate of Variance in Random Sample 
Block Sum of Squares and Error Mean Square, in Boi dface Type, Copied 

From Previous Table. 



Degrees of 

Sum of 

Mean 


Source of Variation 

Freedom 

Squares 

Square 


Plants (blocks) 

7 

575 



Error -f- treatments 
Total 

8 

74 3 

9 29 


(estimated random) 

15 

649 3 

43 3 



Efficiency of blocks as compared to random sampling ~ 43.3/9.29 = 466% 
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This is the last of the series of sign posts that have guided you to the easier portions of the text. 
The most used methods of biological statistics have been presented. Any of the following chapters 
may now be selected for study , or you may wish to examine more in detail some of the earlier secttons 
omitted. J 


EXAMPLE 11.14 — In example 2.28 you compared the gains in weight of 10 pairs 
of litter mates (swine). Analyze the variance of the 20 gains, as follows- 


Source of Variation 

Degrees of Freedom 

Mean Square 

Treatment 

1 

14.45 

Litter 

9 

6 05 

Discrepance 

9 

3 56 


For treatments, compare F with f 2 . Compare the mean square for discrepance with the 
variance of the differences, 7.12, in the earlier example. 

EXAMPLE 11.15— Show that the experiment of example 11.10 had the efficiency 
122% as compared to the random assignment of animals to the treatments. This 
means that between 12 and 13 pigs would have been required in each random group 
to achieve the same accuracy as was got by 10 outcome groups. 


11.6 — Randomized blocks in a field trial. This design has been 
widely used in field plot experimentation. The fact that neighboring 
plots tend to yield alike makes it desirable to try each treatment (or 
variety) on one of a block of plots lying as closely together as feasible. 
Uniformity of fertility throughout the block is the ideal. 

The block is divided into a number of plots equal to the number of 
treatments. The treatments are then assigned to the plots at random to 
insure to each the equality of opportunity to occupy any superior position. 
To equalize the hazard of position in the experiment as a whole, two or 
more blocks or replications are laid out, each containing all the treatments. 
In field experiments from 3 to 10 replications are common, the number 
depending on the precision required. 

Figure 11.2 is the plan of a field layout for testing 4 strains of Gallipoli 
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Fig. 11.2 — Field plan of an experiment with four strains, A, B , C, and D, of Gallipoli 
wheat. Yields are in pounds per plot. 
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wheat in each of 5 blocks (9). Narrow plots extending the length of the 
block are preferable: there may be considerable variation in fertility from 
end to end of the block, but the aim is to have this affect all the plots alike. 
The blocks should not be too narrow, but should approach squareness. 

The data together with the analysis of variance are recorded in table 
11.9. Notice the coefficient of variation, C = s/x = a/2. 19/32. 8, only 


TABLE 11.9 

Yields of Four Strains of Gallipoli Wheat Planted in Five Randomized Blocks 

Pounds per Plot 


Strain 



Block 


i 

Sum 

Mean 

1 

2 

3 

4 

5 

A 

32 3 

34 0 

34 3 

35 0 

36 5 

172 1 

34 4 

B 

33 3 

33 0 

36 3 

36 8 

34 5 

173 9 

34 8 

C 

30 8 

34 3 

35 3 

32 3 

35 8 

168 5 

33 7 

D 

29 3 

26 0 

29 8 

28 0 

28 8 

141 9 

28 4 

Sum 

125 7 

127 3 

135 7 

132 1 

135 6 

656 4 

32 8 


Analysis of Variance 


Source of Variation 

Degrees of 

Freedom 

Sum of 
Squares 

Mean 

Square 

Blocks 

4 

21 46 

5 36 

Strains 

3 

134 45 

44 82** 

Error 

12 

26 26 

2 19 

Total 

19 

182 17 



4.5%, indicating unusually accurate field plot experimentation. Another 
interesting feature is the similarity of the first three means. One suspects 
that the highly significant F, indicated by the double asterisk on the treat- 
ment mean square, is attributable largely to the small yield of D. It has 
been noted before (section 10.4) that while exact tests of differences be- 
tween pairs of treatment means were not planned in the design of this 
experiment, adequate information springs from two sources: (i) the array 
of means, each an unbiased estimate of strain yield; and (ii) the fiducial 
limits of these means. The latter (section 3.8) are calculated from 

t osJf = 2.179\/2.19/5 = =*=1.4 pounds per plot 

The upper limit for D is 28.4 + 1.4 = 29.8 pounds, while the lower for 
the next mean in the array is 33.7 — 1,4 = 32.3 pounds. Since there is 
no overlapping of the intervals, the evidence is quite convincing that the 
means for C and D are not samples from the same population. On the 
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contrary, the intervals for the other 3 means indicate no distinctions among 
the corresponding populations. 

An experiment like this might have been designed to test three new 
strains, A, B, and C, against the standard, D. In that case, the 3 degrees 
of freedom for strains would naturally fall into two parts; 2 d.f. for differ- 
ences among the 3 new strains and 1 for the contrast, old vs. new. The 
sum of squares for the first part is 

(172.1) 2 + (173.9) 2 + (168.5) 2 (514.5) 2 

5 15 = 3-02 


For the single degree of freedom, the sum of squares is (section 10.8) 


(514. 5) 2 (141.9) 2 (656.4)2 

15 + 5 20 


131.43 


Note that the sum of the two parts is the strain sum, 134.45. For this 
design the analysis in table 11.10 would be appropriate. 


TABLE 11.10 

Analysis of Hypothetical Experiment Designed to Contrast 3 New Strains, 
A , B, and C, With the Standard, D. The Data Are Those in Taplf 1 1 9. 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Strains A, Z?, and C 

2 

3 02 

1 51 

Old vs new 

1 

131 43 

131 43** 

Error (table 11.9) 

12 


2 19 


The moral is plain: exact tests of significance are available for com- 
parisons built into the experimental structure, but no definite probability 
statements can be made about contrasts suggested by the data. Further 
information about individual comparisons will be found in chapter 15. 

Some people still question the advisability of random arrangements 
for field plots. There are many arguments pro and con, and they are all 
complicated. One obstacle to clear understanding is the unlimited 
number of nonrandom (that is, systematic) arrangements: each must be 
considered separately. As examples, the “knight’s move” was investi- 
gated by Tedin (23), and the “half drill strip” by Gossct (“Student”) (22). 
For a more general discussion, see (8). 

The simplest and least defensible of systematic layouts is the same 
succession, ABCD for example, in all the blocks. If there is any uniform 
gradient in fertility the result may be that, in every application, A is on 
better soil than the other treatments. This is not common, but I have 
seen it happen. Another difficulty with this scheme is that experimental 
error is biased downwards with a consequent exaggeration of significance. 
To see why, recall two facts: (i) plots equally distant tend to differ by the 
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same amount; and (ii) the estimate of error is based on the failure of yield 
differences, such as A — C, to be identical in the several blocks. Hence, 
if A and C are the same distance apart in every replication, the differences 
are likely to be more alike than if the 2 treatments are randomized. 
Another generally applicable argument is that A and £, being contiguous, 
are subject to less error than are A and D : discrepance, an average of these 
errors, is not appropriate for either comparison. Randomization tends to 
produce an unbiased error equally suitable for testing all differences, 
and it is a necessary condition for exactness in the probabilities associated 
with fiducial statements and tests of null hypotheses. 

The theory of experimentation on which this text is founded specifies 
erroneous inferences in a predetermined percentage (5%, for instance) 
of trials. Any particular random arrangement of plots applied to an area 
chosen for the experiment may turn out to be the worst possible. The 
experimental approach is not designed for him who makes a single assay: 
it is only the average of many trials that is guaranteed to approach the 
true value. During a life of research the investigator is assured that, 
provided he makes a fresh randomization for every trial, he will be led 
to correct probability statements in the chosen fraction of his experiments. 

The reader who wishes to learn more about randomized blocks may 
consult references (7) section 48, (8) chapter 4, and (5) chapter 2. There, 
more complicated designs also are discussed. 

Accidents often result in the loss of data in some of the cells of a 2-way 
table. Crops may be destroyed, animals may die, or errors may be made 
in recording. One missing item can be supplied with the aid of a formula 
proposed by Allan and Wishart (1) and modified by Yates (26): 


where 


tT+bB-S 

' ~ (/-i)(i-i) 

t — number of treatments 
b = number of blocks 

T = sum of items with same treatment as missing item 
B = sum of items in same block as missing item 
S — sum of all observed items 


As an example, suppose the item 29.3 pounds were missing from table 1 1.9, 
block 1, strain D. Omitting this item from the sums for block 1, strain D 
and total, 

B = 96.4, T = 112.6, S = 627.1, b = 5, t = 4 
Then, X = L(4) (1 12.6) + (5)(96.4) - 627.1]/(3)(4) = 25.4 


This value would be entered in the table as the yield of the missing plot. 
The analysis of variance would then proceed as usual, with this one 
modification: the number of degrees of freedom for total and error would 
each be decreased by unity, becoming 18 and 11 instead of 19 and 12. 
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The theory is that X is so determined that the error sum of squares is 
as small as possible. This means that the supplied item is its own expected 
value (section 11 . 3 ), its deviation from expected being zero. Nevertheless, 
the deduction of a degree of freedom for the missing datum makes the 
estimate of error unbiased. But the treatment mean square is biased 
upward, so that the F-value tends to be exaggerated somewhat. Hence, 
it would be wise to think of a significant F as nearer to the 1 % than to the 
5% point. 

If two or more data are missing, very good results can be got by an 
iterative method. If X\ and X<l denote two missing values, first enter in 
the table a reasonable value for Xi (the experimental mean, for example, 
or the mean of the treatment and block containing A r 2 ), then substitute in 
the formula for X\ values of T, B , and S which include AY Now set in the 
table the calculated X h remove A 2 , recompute T, B , S, and substitute in 
the formula to obtain a better approximation to AY Repeat the process 
for an improved AY After a few cycles the new estimate will be practically 
the same as the old. The data are then ready for analysis of variance with 
two degrees of freedom deducted from total and error. 

For practice, turn to example 11.2 and assume that data are missing 
from row 2, column 2 (A”i) and from row 3, column 1 (Aj). Enter 18 for 
A 2 because both row and column totals are large. The formula gives 
A i = 11.5. Supply this value in row 2, column 2, eliminate the entry for 
AY then again substituting in the formula calculate A' 2 = 16.9. Repeti- 
tion of the process results in A’i = 11.8 and Xz ~ 16.8. The analysis of 
the resulting variance is, 


Source of Variation 


Degrees of Fieedom 


Mean Square 


Rows 

Columns 

Discrepance 


2 

2 

2 


72 
16 
3 2 


The ease of analyzing experiments with missing data may deceive you 
into thinking that such losses are unimportant. Actually, the loss of 1 plot 
in 12 means the loss of one-twelfth of the information planned in the 
experiment. The only objective of the foregoing methods is to salvage as 
much as possible of the remaining eleven-twelfths. 


EXAMPLE 11.16 — In the Gallipoli wheat trial, calculate the efficiency of the ran- 
domized blocks as compared to unstratified random sampling. Ans. 131%. Stratifica- 
tion produced greater precision than another replication would have accomplished in 
a completely random arrangement. 


EXAMPLE 11.17 — In a cooperative seed treatment trial (16) the numbers of 
plants emerging from 100 planted soybean seeds, Kanro variety, were reported for 
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each plot of a randomized blocks experiment conducted by the South Carolina Truck 
Experiment Station: 


Block 

Treatments 

Untreated 

Arasan 

Spergon 

Semesan, Jr. 

Fermate 

1 

92 

98 

96 

97 

91 

2 

90 

94 

90 

95 

93 

3 

88 

93 

91 

91 

95 

4 

87 

89 

92 

90 

95 

5 

89 

95 

90 

94 

97 


Analyze the variance. 


Source of Variation 


Treatment 

Blocks 

Error 


Degrees of Freedom 


4 

4 

16 


Mean Square 


20 95 * 
12 45 
5 412 


If you compare the mean of treated with the mean of untreated plots you will get 
93.3 — 89.2 = 4.1 plants, with the corresponding mean square, 67.24. Hence, 
F « 12.42**. 

EXAMPLE 11.18 — Following are yields of corn (bushels per acre at 15% moisture) 
on Marshall silt loam in Iowa (1944): 


Block 

Plowed 

Listed 

Sub-surface 

Tilled 

Disked 

Total 

1 

102 

93 

75 

78 

348 

2 

94 

96 

59 

71 

320 

3 


88 

73 

66 

227 

Total 

196 

277 

207 

215 

895 


Supply a number for the missing yield (Ans. 95 bushels per acre), then analyze the 
variance. Ans. F is 17.5 with dj. = 3 and 5. 

EXAMPLE 11.19 — In the foregoing example the yield, 102 bushels per acre, was 
estimated from the undamaged portion of a plot whose stand had been partly destroyed 
during cultivation. Eliminate this datum from the table above, then supply yields for 
two missing plots by the recursion method. Ans. In block 1, 100.7 bushels per acre, 
and in block 3, 94.4. The analysis of variance results in the following degrees of 
freedom and mean squares: 

Treatment 3 588 49 

Error 4 43.17 

EXAMPLE 11.20 — After supplying the two missing data of example 11.19, calcu- 
late regression values for the 12 yields (section 11.3). The data supplied should be 
approximately equal to their own X ’s, their deviations being zero. These missing plots 
contribute nothing to error, hence the loss of a degree of freedom for each. 

EXAMPLE 11.21 — The mathematically inclined will wish to derive the formula 
for a missing item. Substitute X for the yield in the blank cell, then carry through the 
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computations in the usual way. At the end, the sum of squares for error will contain 
terms in both X and X*. Calculate the value of X which minimizes the error. 

EXAMPLE 11.22 — Any systematic arrangement is one of the possible random 
arrangements. Suppose your drawing led to ABCD in every block, what would you do? 

11.7 — The latin square. This experimental design perfected by 
Fisher (7) is especially well adapted to the field trial of usually not more 
than eight treatments or varieties. Interest in the design is by no means so 
limited, however, since it has remarkable theoretical as well as practical 
implications (8, chapters 5 and 7). The number of replications must be 
the same as the number of treatments. The field, which should be a 
rectangle somewhat approximating squareness, is divided into two sets of 
blocks at right angles. One set is usually designated rows , the other 
columns. If there are k rows, then there are k columns and k 2 plots. Upon 
the random assignment of the treatments to the plots there is placed this 
restriction: each treatment must occur once and only once in each row 
and each column. 

The design is illustrated by table 11.11 (13) where the treatments 


TABLE 11.11 

Yields (Grams) of Plots of Millet Arranged in a Latin Square 
Spacings: A, 2-inch; /?, 4; C, 6; D, 8; E, 10 






Column 





Row 

1 

2 

3 

4 

5 

Sum 


1 

B' 257 

E* 230 

A: 279 

C: 287 

D: 202 

1,255 


2 

D- 245 

A: 283 

E • 245 

B- 280 

C: 260 

1,313 


3 

E. 182 

B- 252 

C. 280 

D • 246 

A: 250 

1,210 


4 

A: 203 

C: 204 

D. 227 

E: 193 

B: 259 

1,086 


5 

C 231 

D: 271 

B • 266 

A - 334 

E: 338 

1,440 

Sum 

1,118 

1,240 

1,297 

1,340 

1,309 

6,304 



Summary by Spacing 




A: 2" 

B: 4" 

C. 6" 

D 8" 

E: 10" 


Sum 


1,349 

1,314 

1,262 

1,191 

1,188 

! 6,304 

Mean 

269 

8 262 8 

252 4 

238 2 

237 6 

252.2 


Analysis of \ ariance 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Total 

24 

36,571 


Spacings 

4 

4,156 

1,039 

Rows 

4 

13,601 

3,400 

Columns 

4 

6,146 

1,536 

Error 

12 

12,668 

1,056 
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A y B, C, D , and E are located as they were in the field. Since each row 
and column contains every treatment, differences among either set are 
attributed to soil variation. It is clear that the rows and columns should 
be not far from the same width in order to share alike in the soil heteroge- 
neity. This results in desirably compact, almost square plots. Since the 
plots are usually small, the soil variation in them is small. Unlike the 
randomized block design whose blocks need not be contiguous, the latin 
square loses its advantages if the plots are not continuous in the field. 
On this account, the square is somewhat less flexible than the blocks. 
Also, complete loss of data for a treatment (or row or column) increases 
the difficulty of analysis (29). Under suitable conditions, however, the 
square is, on the average, about 20% more efficient than the blocks. 

The data in table 11.11 result from an experiment on the effect of 
spacing millet plants in the row. The letters denote 2-, 4-, 6-, 8-, and 10- 
inch spacings. This table shows not only the planting arrangement, but 
also the plot yields and the sums for rows and columns. In the last two 
lines are the sum and mean for each spacing. 

In the calculations below, leading to the analysis of variance, there is 
only one addition to those heretofore described, that yielding the sum 
of squares for the columns. 


1. Correction: (6,304) 2 /25 = 1,589,617 

2. Total: (257) 2 + . . . + (338) 2 - 1,589,617 = 36,571 


3. Spacings: 


(1,349) 2 + . . . + (1,188) 2 
5 


- 1,589,617 = 4,156 


4. 


Rows: 


(1,255) 2 + . . . + (1,440) 2 
5 


- 1,589,617 = 13,601 


5. 


0 1 (1,11 8) 2 + . . . + (1,309) 2 

Columns: 


6. Remainder: 12,668 


1,589,617 = 6,146 


The mean squares arc estimates of these population variances: 


Spacings 

Rows 

Columns 

Error 


cr 2 + 5<r* 2 
a 2 + 5<r* 2 
a 2 + 5cr e 2 


The testing of null hypotheses about spacings, rows, and columns clearly 
offers no difficulties. 

From the mean squares one concludes that the spacing effect is negli- 
gible, the variation being just that to be expected in sampling from a 
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single normal population of yields. But the design of the experiment 
included a feature not yet brought into the analysis — the regression of 
yield on spacing. The summary by spacing in table 11.11 shows mean 
plot yield decreasing with wider spacing. It appears, then, that the sum 
of squares for spacings, 4,156, contains a considerable portion attributable 
to linear regression, and that it will be informative to subdivide this sum 
accordingly. We shall need Sx 2 , Sxy, and Sy 2 in the manner of chapter 6. 
The latter is already calculated, 4,156. The other two are conveniently 
computed from means, then expanded to the item basis (characteristic of 
sums of squares and mean squares) through multiplication by k = 5. 

&t 2 = (2)2 + . . . + (10) 2 - (30) 2 /5 = 40 

Sxy = (2) (269.8) + . . . + (10)(237.6) _ t 30 ^ 1 ’ 260 : 8 ) = _ 178 

5 

The desired sums of squares and products, therefore, are 200 and — 890. 
Finally, -Sy is separated into the two parts: (i) attributable to regression, 
(SxyY/Sx 1 = ( — 890)7200 = 3,960; and (ii) deviations from regression, 
4,156 — 3,960 = 196. The revised analysis of variance (omitting rows 
and columns) is in table 11.12. 


TABLE 11.12 

Analysis of Regression of Spacing Means on Width of Spacing. 
Millet Experiment 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Regression 

1 

3,960 

196 

3,960 

Deviations 

3 

65.7 

Error (table 11.10) 

12 


1,056 


For regression, F *= 3,906/1,056 = 3.70, F 05 533 4 75 


It now appears that most of the variation in yield Is associated with 
regression, deviations from regression being trivial. To put it another way, 
the major portion of the sum of squares for spacings belongs to the single 
degree of freedom for regression. Even so, this regression is not significant 
so that there may be no relation between spacing and yield in the popula- 
tion. It is interesting, however, that, in the sample, the notable increase 
of yield per plant due to freedom from competition in the wider spacings 
failed to compensate for the fewer plants. Notice the sizable coefficient 
of variation, C — 1 ,056/252.2 — 13%, which may explain the non- 

significance of the rather striking regression. 
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Missing data in a latin square can be supplied in the manner explained 
in the preceding section. The formula is now 

k{R + C + T) - 2S 

(A - 1)(* ~ 2) ’ 

where k = number of rows (or columns or treatments) 

R — sum of items in same row as missing item 
C = sum of items in same column as missing item 
T = sum of items with same treatment as missing item 
S — sum of all observed items 


For an illustration, assume that in example 11.23 the plot yield with 
treatment A in row 1 is missing. We have R = 52, C = 43, T = 28, 
5 « 163, k = 3, 


3(52 + 43 + 28) - 326 
(3 ~ 1)(3 - 2) 


21.5 


EXAMPLE 11.23 — Here is a latin square for easy computation. Treatments are 
indicated by A , B, and C. 


! 


Columns 


Rows 

1 

2 

3 

1 

B: 23 

A • 17 

C: : 

2 

A • 16 

C: 25 

B’ 1 

3 

C: 24 

B: 18 

A: 1 


The mean squares are: rows, 21 ; columns, 3; varieties, 93; remainder, 3. 


EXAMPLE 11.24 — Code the items in the latin square of the last example by 
subtracting some easy number like 10 or 12, then duplicate the analysis of variance. 

EXAMPLE 1 1 .25 — Do the analysis of variance in example 1 1 .23 by use of deviations 
from mean. 

EXAMPLE 11.26 — In the manner of section 11.3 insert regression values in the 
table of example 11.23. Each will be the sum of four parts: (i) the row mean, (ii) the 
column mean, (iii) the treatment mean, and (iv) minus twice the general mean. The 
sum of the 9 deviations is zero, while the sum of their squares is 6, the remainder in the 
table of analysis of variance. 

EXAMPLE 11.27 — You can learn a lot about latin squares by constructing one 
for easy computation like that of example 11.23, especially if you start with sets of 
deviations for row, column and variety and put in the expected numbers first. 


EXAMPLE 11.28 — A 7X7 latin square was laid out to test the effect of various 
fertilizers on the yield of potatoes. Here is the field plan with the plot yields (bushels 
per acre). The letters specify the treatments. 
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Row 




Column 



1 

2 

3 

4 

5 

6 

7 

1 

C : 344 

B: 428 

A: 423 

D: 432 

E: 449 

F: 449 

G: 410 

2 

D: 367 

A: 430 

F: 443 

C: 387 

B: 447 

G: 426 

E: 372 

3 

B: 317 

C: 389 

D: 425 

A: 389 

G: 408 

E: 397 

F: 384 

4 

E: 303 

D: 393 

G: 442 

F: 427 

C: 386 

B: 358 

A: 342 

5 

A: 291 

E: 423 

B: 412 

G: 392 

F: 364 

C: 341 

D: 333 

6 

F: 261 

G: 374 

E: 398 

B: 347 

D: 323 

A: 284 

C: 273 

7 

G: 258 

F: 376 

C: 404 

E: 317 

A: 234 

D: 250 

B: 233 


Compute the analysis of variance: 


Source of Variation 

Degrees of Freedom 

Mean Square 

Rows 

6 

14,143 

Columns 

6 

10,831 

Treatments 

6 

1,936* 

635 

Error 

30 


11.8 — More than one item in the subclasses. Some of you are asking: 
“Is discrepance always a valid estimate of error? Is there some logical 
foundation for this in experimentation?” To the first question, the 
answer is, “No.” The second requires some elaboration. In the experi- 
ments hitherto presented, there is both logical and experimental justifica- 
tion for the use of discrepance as error. On the logical side, randomization 
together with rather uniform conditions leave in discrepance no variation 
except such as cannot be controlled. A long series of studies of uniformity 
trials, in field crops especially, have furnished the experimental evidence. 
In such standardized experiments discrepance can be relied upon to 
measure experimental error. 

If you are beginning some investigations where nothing is known of 
the validity of discrepance as error, you should provide at least two and 
preferably three determinations of the variate in each subclassification. 
The method is illustrated in table 11.13, where three such determinations 
were made (17). 

This was a greenhouse experiment to learn the effects of two kinds of 
soil treatments. Each combination of treatments, as for example “soil 
plus straw” with “carbon dioxide gas,” was laid down in three pots. To 
eliminate the possibility that position might affect the pots differentially 
the 48 pots were placed at random in the space provided in the green- 
house. Don’t overlook this precaution. If you place together three pots 
containing one combination of treatments, and in a second place those 
with another combination, the effects of position and treatment will be 
confounded and the three pots may be worth no more than a single de- 
termination. 



276 


Statistical Methods 


Section 11.8 


TABLE 11.13 

Yield of Wheat in 48 Pots. Greenhouse Experiment With Two Series of Soil 
Treatments, Three Pots For Each Combination 
Grams 





Chemical Treatment 



Humus and 






Sum, 

Yield 

Fertilizer 




CO a 

Carbonic 

12 

per 

Treatment 

Pot 

None 

Ji + O 

Gas 

Acid 

Pots 

Pot 

None 

1 

21.4 

20.9 

19.6 

17.6 i 




2 

21.2 

20 3 

18 8 

16 6 




3 

20 1 

19.8 

16.4 

17.5 




Sum 

62 7 

61.0 

54 8 

51.7 

230.2 

19 2 

Straw 

1 

12.0 

13.6 

13.0 

13.3 




2 

14 2 

13 3 

13 7 

14 0 




3 

12 1 

11 6 

12 0 

13 9 




Sum 

38.3 

38 5 

38 7 

41 2 ! 

156 7 

13 1 

Straw -f POi 

1 

13.5 

14 0 

12 9 

12 4 




2 

11.9 

15 6 

12 9 

13 7 




3 

13 4 

13.8 

13 1 

13 0 




Sum 

38 8 

43 4 

38 9 

39 1 i 

160 2 

13 4 

Straw -f POi 

1 

12 8 

14 1 

14.2 

12 0 



-f lime 

2 

13 8 

13 2 

13 6 

14.6 




3 

13 7 

15.3 

13 3 

14 0 




Sum 

40 3 

42 6 

41 1 

40 6 

164.6 

13 7 

Sum, 12 pots 


180 1 

185 5 

173 5 

172 6 

711 7 


Yield per pot 


15 0 

15 5 

14.5 

14.4 


i 


From the viewpoint of computation there is only one new feature. One 
must pool the sums of squares in the 16 subclasses. Instead of doing this 
directly it is easier to proceed thus: 


1 . 

2 . 

3 . 


(711.7) 2 

Correction: — 10,552.44 

48 

Total: (21.4) 2 + (21.2) 2 + (20.1) 2 + (12.0) 2 + . . . + (14.0) 2 


Subclasses: 


- 10,552.44 = 367.15 


(62. 7) 2 + (38. 3) 2 + . . . + (40. 6) 2 
3 


- 10,552.44 = 340.87 


4. Within subclasses: 367.15 — 340.87 == 26.28 


You can easily verify the last result by adding the 16 sums of squares 
you get from applying the usual computational method to each of the 
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subclasses. In the first subclass, for example, the sum of squares is 
(21.4) 2 + (21.2) 2 + (20. 1) 2 - (62.7) 2 /3 = 0.98. 

The sum of squares for subclasses is now divided into three parts in 
the manner already familiar: 

5. Humus treatments: 


(230.2)2 + . . . + (164.6) 2 
12 


10,552.44 « 306.24 


6. Chemical treatments: 
(180.1) 2 + . . . + (172.6)2 
12 


- 10,552.44 = 9.17 


7. Subclass discrepance: 340.87 — (306.24 + 9.17) = 25.46 

The results are recorded in table 11.14. As will be explained below, 


TABLE 11.14 

Analysis of Variance of Wheat Yields in Soil Treatment Experiment 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Humus treatments 

3 

306.24 

102.08** 

Chemical treatments 

3 

9.17 

3 06* 

Interaction 

9 

25 46 

2.83** 

Pots treated alike 

32 

26.28 

0 82 

Total 

47 

367.15 



the subclass discrepance is known as treatment interaction. 

The degrees of freedom for pots are worthy of note. Each subclass 
contributes two, making (16) (2) = 32 in all. The mean square, 0.82, 
measures the average variation of yields from pots treated alike and placed 
at random in the experimental space. It evaluates those uncontrolled 
sources of variation — seed, soil, and position — properly assigned to experi- 
mental error. 

The striking feature of this experiment is the discrepance among the 
subclasses. The chemicals applied to one humus treatment produced 
yields out of accord with those from other humus treatments. An out- 
standing illustration lies in the differences between successive chemical 
treatments in the rows “None” and “Straw + PO”: 

None: 1.7, 6.2, 3.1 

Straw + PO A : —4.6, 4.5, —0.2 

It is the failure of such pairs of differences to be alike, you remember, 
that produces discrepance. Its value in this experiment indicates that 
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the effect of a chemical treatment is not always the same, but depends 
upon the humus treatment to which it is applied. The term used to 
describe this phenomenon is interaction : it is the discrepance between 
experimental treatments. Its measurement and test of significance were 
planned in the design of this experiment. In field trials it is generally 
assumed with some evidence that with fairly uniform soil and in trials of 
varieties or treatments not obviously different there is no real interaction. 
The discrepance, therefore, measures only experimental error. In the 
soil treatment experiment, on the contrary, there was no reason to believe 
that the treatments of one set would react alike with those of the other set; 
hence, the provision of 3 pots per combination to furnish an estimate of 
random variation on the greenhouse bench. 

The components of variance in this kind of experimental design are: 
Treatment 1 (chemical) cr 2 + 3ai2 2 + 12<ri 2 

Treatment 2 (humus) a 2 + 3ai 2 2 + 12or 2 2 

Interaction a 2 + 3cri2 2 

Individuals a 2 

Evidently, interaction is tested thus: 



The manner of testing treatments depends upon conditions under 
which the null hypothesis is set up. In this experiment, the hypothesis, 
a i = 0 is to be tested on the assumption that the chemical treatments 
are always to be tried with this same set of humus treatments — this latter 
set constitutes a population, and there will be no changes in it to affect 
the variation of the chemical treatments. Under this condition, s 2 esti- 
mates the real error in evaluating treatment 1, and the appropriate test 
of <7i = 0 is 

^ s 2 + 35] 2 2 + 12.fi 2 3.06 _ 


Similarly, if the hypothesis, 02 = 0, is tested on the assumption of un- 
changing chemical treatments, then F = 102.08/0.82. 

In some experiments to be described later (section 11.13 et seq .), one 
set of treatments, treatments 2 for example, might be considered as a 
random sample of possible such treatments, so that differences among 
treatments 1 would be affected by the particular sample of treatments 2 
which happened to be drawn. Under this condition you will find that the 
estimate of error must be correspondingly increased, a i = 0 being tested 
by 

_ s- + kiSn 2 + ktfx 1 
~ i 2 + 

A glance at the sums for the humus treatments shows a striking con- 
trast between those for the treated pots and that for the untreated. This 
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calls for separation of the sum of squares, 306.24, into two parts; one 
corresponding to variation among the treatment means and the second 
to the difference between all the straw treatments and none. The sum 
of squares for interaction also may be heterogeneous. For subdivision 
of both, it is convenient to condense the sums of the subclasses thus: 



Chemical Treatment 

. 

Humus 

1 

2 

3 

4 

Sum 

Straw 

117.4 

124.5 

118.7 

120.9 

481.5 

None 

62 7 

61.0 

54 8 

51.7 

230 2 

Sum 

180 1 

185.5 

173 5 

172 6 

711.7 

Difference 

54.7 

63.5 

63.9 

69.2 



In steps 3 and 5 of the computation, note the modification due to un- 
equal numbers in the subclasses: 

a c k , < 117 - 4 ) 2 + • • • + ( 120 * 9 ) 2 , ( 62 * 7 ) 2 + ■ • • + (51.7) 2 

3. Subclasses: 

9 3 

-10,552.44 = 333.58 

(481.5) 2 (230. 2) 2 

5. Humus treatment: 1 10,552.44 = 303.63 

36 12 

6. Chemical treatments: as before, 9.17 

7. Interaction: 333.58 - (303.63 + 9.17) - 20.78 

This interaction measures dissimilarities among the differences in the 
last line of the condensed table. The upward trend seems to be due to a 
downward trend in the no-humus sums rather than to changes in the 
treatment sums. 

Table 11.15 contains the completed analysis of variance. It is now 


TABLE 11.15 

Completed Analysis of Variance of the Wheat Yields in Table 11.13 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Pots in subclasses 

32 

26.28 

0.82 

Chemical treatments 

3 

9 17 

3.06* 

Humus treatments: 




Three treatments 

2 

2 61 

1.30 

Treatment vs. check 

1 

303.63 

303.63** 

Interactions: 

Three humus treatments 




with 4 chemical 

6 

4 68 

0.78 

Humus treatment vs. humus 




check, with 4 chemical 

3 

20.78 

6.93** 

Total 

47 

367.15 
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evident that the differences that may be attributed to the population are 
(i) those among the chemical treatment means; (ii) the difference be- 
tween the means of straw treatments and no humus treatment, 

230.2/12 - 481.5/36 = 5.8 grams; 

and (iii) the interaction between chemical treatments and the 2 humus 
treatments, straw and none. 

EXAMPLE 11.29 — A cooperative experiment on the effect of fertilizer resulted 
in the data below. The yields are in decagrams per sample of broadcast oats, the 
samples being 36" X 36" harvested squares, four of them from each 21' X 52' plot. 
Soil, Carrington loam. 


Block 

Check 

N 

P 

JVT 

1 

19 

18 

16 

13 


16 

16 

22 

27 


18 

17 

18 

28 


18 

14 

18 

28 

2 

15 

22 

19 

15 


16 

24 

20 

14 


16 

22 

20 

18 


17 

20 

24 

14 

3 

21 

20 

24 

25 


19 

21 

15 

20 


18 

20 

19 

24 


22 

20 

20 

20 

4 

18 

20 

20 

20 


20 

22 

20 

18 


20 

18 

17 

12 


18 

19 

20 

18 


Fertilizers containing nitrogen and phosphorous were applied both separately and in 
combination — a factorial design. Calculate the following analysis of variance. 


Source of Variation j 

Degrees of 
Freedom 

Mean 

Square 

Variance of 

Plot Mean 

Treatment 

3 

7 6 


Block 

3 

12 7 


Experimental error 

9 

31 78 

7.94 

Sampling error 

48 

7 46 



Note the large coefficient of variation, C = 14.7%. This is due chiefly to field 
variation because the sampling error is only a small part of the experimental error. 
Estimate the variance of a plot mean if the sampling had been increased to 8 quadrats 
per plot. Ans. Doubling the sampling rate would decrease the experimental error by 
only 11.7%. 
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EXAMPLE 11.30 — The pots of table 11.13 might have been placed on the bench 
in 3 blocks with the 16 treatment combinations randomized in each. In that case the 
32 degrees of freedom for pots would have been subdivided in this manner: 

Blocks 2 

Discrepance 30 

Do you think this would have been a better design? 

EXAMPLE 11.31 — Calculate directly the subclass discrepance, 4.67, in the 
3X4 table of sums comprising the straw and chemical treatments but omitting the 
humus check. Follow computing directions 3, 5, 6, and 7. 

EXAMPLE 11.32 — The gains in weight (grams) of six lots of male rats are shown 
in this table: 



High Protein 


Low Protein 

Beef 

Cereal 

Pork 

Beef 

Cereal 

Pork 

73 

98 

94 

90 

107 

49 

102 

74 

79 

76 

95 

82 

118 

56 

96 

90 

97 

73 

104 

111 

98 

64 

80 

86 

81 

95 

102 

86 

98 

81 

107 

88 

102 

51 

74 

97 

100 

82 

108 

72 

74 

106 

87 

77 

91 

90 

67 

70 

117 

86 

120 

95 

89 

61 

111 

92 

105 

78 

58 

82 


Calculate the analysis of variance as follows: 


Source of Variation 

Degrees of Freedom 

Mean Square 

Level of protein 

1 

3,168 3 

Source 

2 

133.2 

Interaction 

2 

589 0 

Error 

54 

214 6 


11.9 — Proportional subclass numbers. For one reason or another the 
numbers of observations in the subclasses of a table like 11.13 may not be 
equal. When through accident, or under the limitations of an experi- 
mental technique, erratic losses of data occur, the addition theorem for 
sums of squares is annulled and the analysis of variance must be modified. 
Appropriate methods are described in following sections. Occasionally, 
however, the subclass numbers are proportional, as in some of the fore- 
going tables, causing no injury to the analysis of variance. 

In table 11.16 the numbers of dressing percentages (each decreased 
by 70%) for the breeds and sexes represent approximately the proportions 
in which the animals were brought in for slaughter at the College Meats 
Laboratory (3). For illustration we have chosen only a small sample from 
the original data. All the pigs represented have weights between 200 and 
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TABLE 11.16 

Dressing Percentages (Less 70%) of 93 Swine Classified by Breed and Sex. 
Live Weights 200-219 Pounds 



Breed 


i 

2 

3 

4 

5 

Number 

Male Female 

Male Female 

Male Female 

Male Female 

Male Female 

1 

13.3 18.2 

10 9 14 3 

13 6 12.9 

11.6 13 8 

10 3 12.8 

2 

12.6 11.3 

3 3 15.3 

13.1 14.4 

13.2 14.4 i 

10 3 8.4 

3 

11.5 14.2 

10 5 11 8 

4 1 I 

12 6 4 9 

10.1. 10.6 

4 

15 4 15 9 

11.6 11 0 

10 8 

15 2 

6.9 13.9 

5 

12.7 12.9 

15 4 10 9 


14.7 

13 2 10 0 

6 

15.7 15.1 

14.4 10 5 

i 

12 4 

11.0 

7 

13.2 

11 6 12 9 



12 2 

8 

15 0 

14.4 12 5 

1 


13.3 

9 

14.3 

7 5 13.0 



12 9 

10 

16.5 

10 8 7 6 



9 9 

11 

15.0 

10.5 12 9 




12 

13.7 

14.5 12.4 




13 

1 

10 9 12.8 


i 


14 


13 0 10.9 




15 


15.9 13 9 




16 


12 8 




17 


14 0 




18 


11.1 




19 


12 1 




20 


14 7 




21 


12 7 




22 


13.1 




23 


10 4 




24 


11 9 




25 


10 7 




26 


14.4 




27 


11 3 




28 


13 0 




29 - 


12 7 




30 


12 6 




SX 

168.9 87 6 

362.7 182 7 

41 6 27 3 

79.7 33 1 

110 1 55 7 


Total: N - 93, SX « 1,149.4, 5A" 2 « 14,785.62 
Breed sums: 1, 256.5; 2, 545.4; 3, 68.9; 4, 112.8; 5, 165.8. 
Sex sums: Male, 763 0; Female, 386.4. 


219 pounds: in this way any difficulties that might inhere in percentages 
are avoided. Let us emphasize the fact that the subclass numbers are 
representative of population proportions. The picture would be distorted 
by equal class sizes. The only modification we have introduced into the 
original data has been to make the numbers proportional; that is, 
both 12:6 = 30:15 =4:2, etc. 

and 12:30:4:6:10 = 6:15:2:3:5 

While proportional subclass numbers do not disturb the analysis of vari- 
ance the computations must be slightly modified because of the unequal 
sized groups (section 10.8). 
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Here is the method: 

1. Correction: C = (SX) 2 /n = (1,149.4)793 « 14,205.60 

2. Total: SX* - C = 14,785.62 - 14,205.60 = 580.02 

, c. t . ( 168 - 9 ) 2 , < 87 - 6 ) 2 , , ( 55 '7) 2 „ 

3. Subclasses: — — 1 f- . . . 4 C = 122.83 

12 6 5 

4. Within subclasses: 580.02 - 122.83 = 457.19 

5 . Brral; <^ + ... + (!^_C=,7.38 


18 


15 


6 . Sra: W + 125^ _ c. 0.52 

62 31 

7. Interaction: 122.83 - (97.38 + 0.52) = 24.93 
The analysis of variance is given in table 11.17. 


TABLE 11.17 

Analysis op Variance of Dressing Percentages 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Breed 

4 

97 38 

24.34** 

Sex 

1 

0 52 

0.52 

Breed-sex interaction 

4 

24 93 

6.23 

Within subclasses 

83 

457 19 

5.51 

Total 

92 

580.02 



EXAMPLE 11.33 — Between 1934 and 1942, the Test of Performance records the 
Iowa Agricultural Experiment Station included 9 litters each with 1 barrow and 2 
sows. The weights at 180 days of age were as follows: 


Litter 

Barrow 

Sows 

Litter 

Barrow 

Sows 

1 

225 

209 

6 

22 £ 

211 



217 


193 

2 

180 

216 

7 

175 

140 



194 



133 

3 

146 

162 

8 

207 

198 



118 



186 

4 

252 

200 

9 

189 

200 



234 



179 

5 

134 

161 






161 
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Calculate the analysis of variance: 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 


Litters 

8 

23,917.9 

2,989.7 


Sex 

1 

405 6 

405.6 


Litter X sex 

8 

2,838.7 

354.8 


Sows of same litter 

9 

2,299.0 

255 4 



Since the sample chosen for illustration represents only a small fraction 
of the original data, conclusions should be only tentative. 

11.10 — Disproportionate subclass numbers. The 2x2 table. 

Neither equality nor proportionality in subclass numbers is always attain- 
able. For example, one can’t control the proportion of the sexes of chicks 
hatched from differently treated batches of eggs. The consequences may 
be serious if the character measured, such as growth rate, is different in 
the sexes. A lot of chicks with a predominance of males will likely gain 
faster than one with extra females even though the treatment of the second 
lot is more favorable to growth. Much the same effect may be observed 
in table 11.18. The weighted mean of the 2 lots injected with hormone B 
is the smaller, whereas the unweighted mean (ignoring subclass numbers) 
is the greater. As we shall learn, neither pair gives an unbiased estimate 
of the effect of B. 


TABLE 11. IB 

Comb Weights (Milligrams) of Lots of Chicks Injected With 2 Sex Hormones 
Separately and in Combination 



Untreated 


Hormone B 



Number AA" x 

Number SX 

Z 

Untreated 

3 240 80 

12 

1,200 

672 

100 

Hormone A 

12 1,440 120 

6 

112 


15 1,680 

18 

1,872 


Weighted mean 
Unweighted mean 

1,680/15 = 112 

(80 + 120)/2 = 100 


104 

106 


Preliminary Analysis of Variance, Ignoring Disproportion Among Subclass Numbers 


Source of Variation j 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Hormone A 

1 

3,724 


Hormone B 

1 

524 


Individuals 

29 

23,519 

811 



Section 11.10 Analysis of Variance: Multiple Classification 


285 


Another startling characteristic of disproportionality in a 2-way table 
is the failure of the addition theorem for sums of squares. While this is 
not evident in table 11.18, it shows up in the analysis of the rat data of 
table 11.22, the two sums of squares, sex, and generation, adding to more 
than the total sum of squares. Clearly, no proper estimate of interaction 
is given in such a table. In fact, all estimates and tests of significance may 
be biased by the disproportion of subclass numbers, and the appropriate 
statistical methods are thereby complicated (27). Missing data in ran- 
domized blocks and latin square experiments constituted special cases of 
the more general one now to be explained. 

In the problems of this section and the next two, there is required a 
preliminary analysis of variance in which the disproportion of subclass 
numbers is ignored. The analysis follows the pattern for proportional 
numbers in the preceding section. We are to learn how unbiased esti- 
mates and tests of significance may be attained for the main effects (means 
of rows and columns) and interactions. But it should be footed that there 
is no bias in the sums of squares for subclasses, either among or within 
them. In table 11.18, for example, the sums of squares among subclasses, 


(240) 2 (1,440) 2 (1,200) 2 (672) 2 

~T” + ~i2~ + ~“7T~ + 


(3,552) 2 

33 


4,940 


and for individuals, 23,519, are correctly estimated in the manner of 
section 10.8. The terms “unequal subclass numbers” and “dispropor- 
tionate subclass numbers” are not always clearly distinguished. The 
latter is often referred to as the nonorthogonal case: it occurs only in tables 
of multiple classification (more than one criterion). 


A. INTERACTION NEGLIGIBLE 

It will appear immediately that the presence or absence of interaction 
in the population determines the appropriate statistical method. Knowl- 
edge about interaction may come from theory or experience, or only from 
examination of the data. In the latter situation, it is desirable to have a 
convenient test, and this is available in R X 2 tables (2 or more rows, 
2 columns). The sum of squares for interaction is calculated directly 
from the subclass numbers and means (table 11.19), and this is tested for 
significance against the variance of individuals (table 11.18). 

The calculations of table 11.19 require little explanation. Weights, 
W , are made up from the numbers of chicks, k , and these lead to a 
weighted sum of squares (1,216) of the mean differences. A correction 
term yields the interaction mean square which is little greater than 
sampling expectation. Therefore, the interaction in the population may 
be assumed negligible. 

It turns out that the correction term, ( SWDY/SW , is an unbiased 
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TABLE 11.19 

Interaction Negligible in a 2 X 2 Table With Disproportionate Subclass 
Numbers. Computations Following Table 11.18. Chick Hormone Data 



Untreated 

Hormone B 

k\k t 

S2 ~ *1 

a ss D 

WD 

WD * 

k\ -jr k% 

sss W 

At 


h 

*2 

Untreated 1 

3 

80 

12 

100 

2 4 

20 

48 

960 

Hormone A 

12 

120 

6 

112 

4.0 

-8 

-32 

256 






6.4 


16 

1,216 


Sum of squares for interaction 

- SWD* - (SWDY/SW - 1,216 - (16) a /6.4 - 1,216 - 40 - 1,176 
F = 1,176/811 =* 1.45, d.f. « 1 and 29, not significant 


Correction for disproportion 

■■(sum of squares for hormone B) — (SWD) % /SW *= 524 — 40 ■* 484 
(see preliminary analysis, table 11.18) 


Completed Analysis of Variance 


Source of Variation 

Degrees of Freedom 

Mean Square 

Hormone A 

1 (3,724 

- 484) 3,240 

Hormone B 

1 

40 

Interaction 

1 

1,176 

Individuals 

29 

811 


For A f F - 3,240/811 = 4.00, For B, F — 40/811 = 0.05 


estimate of the mean square between the column means (hormone B ); 
hence, the preliminary value in table 11.18 must be reduced by a 

Correction for disproportion = 524 — 40 = 484 

The same correction for hormone A results in the completed analysis at 
the bottom of the table. 

There is now available an unbiased estimate of the mean difference 
due to hormone B acting both with and without A: this weighted mean 
difference is: 

SWD/SW = 16/6.4 = 2.5 mg. 

It is notable that this is not the difference between either the weighted 
or unweighted means of table 11.18. A similar estimate for the difference 
due to A may be calculated from the columns. In this table the weights 
happen to be the same as those in the rows so that 

SWD/SW = [(2.4) (40) + (4.0) (1 2)]/6.4 = 22.5 mg. 
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It is the significance of these weighted differences that is tested by the 
F-values in the table. 


B, INTERACTION PRESENT 
The Method of Unweighted Means (27) 

This case is illustrated by table 11.20. From the preliminary analysis 


TABLE 11.20 

Interaction Present. Computations for 2X2 Table With Disproportionate 
Subclass Numbers. The Data Are Days to Death of 2 Strains of Mice 
Inoculated With 2 Typhoid Organisms (10) 


Organism 

Strain Z 

Strain Ba 

k\k t 

£ i - £i 

** D 

WD 

WD* 

k\ -f- ki 
= W 

ki 

£i 

kt 

X2 

9 D 

31 

4 03 

33 

3 76 

15.98 

0 27 

4 31 

1.16 

DSC 1 

133 

7.80 

188 

4.13 

77 89 

3.67 

285 86 

1,049.11 






93 87 


290.17 

1,050 27 


Preliminary Analysis of Variance from the Original Data 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Subclasses 

3 

1,214 76 


Individuals 

381 

1,387.49 

3.642 


Interaction mean square 

- SWD* - {SWDY/SW - 1,050.27 - (290.17) 2 /93.87 « 153.3 
F - 153.3/3 642 = 42.08, dj. = 1 and 381 


Harmonic mean of subclass numbers, 


1 _ 

k 0 


1/1.1. L 

4^31 ^ 33 ^ 1 33 ^ 188/ 


= 0.01885; k Q = 53.05 


of variance and the interaction mean square it is evident that interaction 
is present in the population — strain Z 1S relatively more resistant to DSC 1 
than is Ba. The presence of interaction dictates the unweighted means as 
unbiased estimates of the strain, organism, and interaction effects in 
this 2X2 table. 

Having discovered that the two typhoid organisms affect the strains of 
mice differentially, the investigator may not wish to go further. He will 
find that the means, 4.03, 3.76, and 4.13, differ only by the average 
amount expected in sampling from a single population (F = 3.88/3.64). 
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The outstanding result is the resistance of strain Z t0 &SC 1. It may be 
uninformative to average the rows or the columns. However, there are 
circumstances in which comparisons between the row means (and between 
the column means) are required. Hence, we proceed with the estimates 
and tests summarized in table 11.21. 


TABLE 11.21 

Mean Days to Death in 4 Groups of Mice, Copied From Table 1 1.20. 
Calculations With Interaction Present 



Z 

Ba 

Sum 

Mean 

9 D 

4 03 

3.76 

7 79 

3 90 

DSC 1 

7 80 

4 13 

11.93 

5.97 

Sum 

11 83 

7.89 

19 72 


Mean 

5 92 

3.94 


4.93 


Mean squares: 

Organisms, 

(11.93 - 7.79)* _ 1 „ g 

4 



Strains, 

(11.83 - 7.89)* _ 3 g g 

4 



Completed Analysis of Variance 


Source of Variation 

Degrees of Freedom Mean Square 

Organisms 

Strains 

Interaction 

Individuals 

1 (4.28) (53.05) 

1 (3.88) (53.05) 

1 

381 

227 1** 

205 8** 

153 3** 

3.642 


If you have forgotten the easy way to calculate mean squares with a 
single degree of freedom, review section 10.5. Since these variances are 
based on means, they must be multiplied by some k to be comparable 
with those already computed; but because k is different for every cell of 
the table an average must be found. The appropriate one is the harmonic 
mean, k 0 = 53.05 (table 11,20). The products of k 0 by the mean squares 
first calculated complete the analysis of variance given in the table. 

If one has ample reason to believe that interaction exists in the popula- 
tion, the major part of table 11.20 may be omitted -it was included to 
justify that hypothesis. If interaction is assumed to be present, its mean 
square may be calculated directly from the unweighted means: 

[(7.80 + 3.76) - (4.03 + 4.13)? . 

4 

Then (2.89) (53.05) = 153.3, as before. 
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The three effects, strains, organisms and interaction, are all tested in 
the manner of section 1 1 .8. 

EXAMPLE 11.34 — A good way to learn the effects of disproportionality and at the 
same time to gain experience in calculation is to complete some analysis incorrectly. 
For example, complete the analysis of the chick hormone data in table 11.19 on the 
assumption that interaction is not negligible in the population. Ans. Mean difference 
for A t 26; for B , 6 milligrams: mean square for A, 4,056; for B , 216. 


EXAMPLE 11.35 — Complete the analysis in table 11.20 on the assumption that 
interaction is negligible. Ans. Difference between organism means, 1.98; between 
strain means, 3.09 days: mean square for organisms, 896.97; for strains, 207.16. 


11.11 — Disproportionate subclass numbers. The R x 2 table. This 
type of table is of rather common occurrence, ordinarily involving the 
two sexes. The computations, starting in the same way as those of the 
preceding section, diverge according to the presence or absence of inter- 
action. 


A. INTERACTION NEGLIGIBLE 

The data in table 11.22 first brought to our attention some of the 
peculiarities of disproportionate subclass numbers. In the preliminary 

TABLE 11.22 

Interaction Negligible in R X 2 Table With Disproportionate Subclass Numbers. 
Number and Mean Gain in Weight (Grams) of 149 Wistar Rats During 1928-29. 
Four Successive Generations. Gains During 6 Weeks Beginning at 28 Days of 
Age (4). 100 Grams Subtracted From Each Gain 


Generation 


Male 

Female 

kikt 

£i - xt 

- D 

WD 

k\ -f" kz 
- W 

ki 

$ i 

kz 

±2 

1 

21 

76 952 

27 

9 518 

11 8125 

67.434 

796 564 

2 

15 

61 467 

25 

14.080 

9 3750 

47.387 

444.253 

3 

12 

55 667 

23 

8.522 

7.8857 

47.145 

371 771 

4 

7 

71.000 

19 

6 790 

5 1154 

64 210 

328 460 






34 1886 


1,941 048 


Preliminary Analysis of Variance of Original Data 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Subclass means 

7 

119,141 


Sexes 

1 

114,287 


Generations 

3 

5,756 


Individuals 

141 

409 


( Continued next page) 
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Interaction sum of squares: *SWD* - (SWDY/SW - 113,385 - 110,202 - 3,183 
Correction for disproportion * 114,287 — 110,202 » 4,085 
Completed Analysis 


Sexes 

1 

110,202 

110,202 

Generations 

3 

(5,756-4,085) 1,671 

557 

Interaction 

3 

3,183 

1,061 

Individuals 

141 

409 


* SWD a is run up on the machine: (67.434) (796.564) -f* • • • 


analysis of variance the two sums of squares, for sex and generation, add 
to more than the total for subclasses, emphasizing the nonorthogonal 
feature introduced by irregularities in the cell frequencies. 

The interaction, though large, falls short of significance. Since there 
is no reason to believe that the difference in the sex rates of gain would 
change with generation, so far as the population is concerned, negligible 
interaction may be assumed. 

The remaining computations proceed as in table 11.19. 

This method is applicable to tables in which the data are missing en- 
tirely from a subclass. Zero may be entered instead of k and x with no 
disturbance of the computations. From the degrees of freedom for inter- 
action deduct 1 for each cell with no entry. 

The unbiased estimate of the sex difference in mean gain is 

SWD/SW = 1,941.048/34.1886 = 56.77 grams 

That is the mean difference tested by F = 110,202/409. 

Unbiased estimates of generation mean differences will be made avail- 
able by the method of fitting constants to be explained in the next section. 
Very good approximations can be got by applying section 11.10-^4, 
table 11.19. As an example, the weighted mean difference between genera- 
tions 1 and 2 is calculated in table 11.23, and a test of significance is 
provided. 

TABLE 11.23 

Approximate Method of Calculating the Weighted Mean Difference Between 
Generations 1 and 2. For Exact Method, See Section 11.12-A 



Generation 1 

*1 

Generation 2 

k 2 £ t 

W 

D 

WD 

Male 

21 

76 952 

15 

61 467 

8 75 

15 485 

135 49 

Female 

27 

9.518 

25 

14 080 

12.98 

-4.562 

-59.21 






21.73 


76 28 


Weighted mean difference = SWD/SW — 3.51 gTams 
Variance of mean difference «* 409/21.73 — 18.82 sj « 4.34 
/ « 3.51/4.34 - 0.81, d.f. * 141 
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B. INTERACTION PRESENT 
The Method of Weighted Squares of Means 

For illustration we take the data in table 11.24 (21). The soils in the 
county were rated according to quality from high (I) to low (IV), and 
the farms classified according to rating and tenure. Among the tenants 
there seems to be a tendency to devote all suitable land to the production 
of corn. 

The preliminary calculations proceed as before. While the interaction 
is not significant, it followed the same pattern in succeeding years and is 


TABLE 11.24 

Acres in Corn Among Owners and Tenants, Audubon County, Iowa, on 4 Soil 

Groups, 1934 


' 

Soil Group 

Owners 

Tenants 


ki 

Si 

hi 

T2 

W 

D 

WD 

I 

36 

32.7 

67 

55 2 

23 418 

22.5 

526 90 

II 

31 

36 0 

60 

53.4 

20.440 

17 4 

355.66 

III 

58 

30 1 

87 

46 8 

34 800 

16 7 

581.16 

IV 

43 

41 6 

68 

43.5 

26.342 

1.9 

50 05 






105 000 


1,513 77 


From preliminary analysis, mean square for farms *=830 

degrees of freedom = 442 


Sum of squares for interaction = SWD 2 — (SWD) % /SW ** 6,020.4 
Mean square = 6,020.4/3 = 2,006.8 

F - 2,006.8/830 = 2.42, d.f. - 3 and 442, Fob « 2.62 


probably a population characteristic. We shall proceed on that assump- 
tion, computing the analysis of variance in table 11.25. Yates called 
this the method of weighted squares of means (27). It duplicates the weights 
in table 11.24, though a different method of calculation is convenient 
to get the weights for the columns: here, the reciprocals are put down 
individually, then added — W is the reciprocal of the sum. The weights 
are now applied to the squares of the unweighted means {WD 1 ) in order 
to reach unbiased sums of squares. 

The factors, 4 and 16, the squares of the numbers of columns and of 
rows, are required because means instead of sums are used in the compu- 
tations. 

If one is sure there is interaction in the population, the preliminary 
computations of table 11.24 may be combined with those of 11.25 in a 
single table. 
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TABLE 11.25 

Computation of Analysis of Variance in R X 2 Table With Interaction Present. 
Weighted Squares of Means. Data From Table 11.24 


Soil 

Group 


Owners 

Tenants 

Sum of 
Reciprocals 

Weights 

W 

Un- 

weighted 

Means 

2 

W2 


k i 

36 

67 | 





I 

i/k 

0.027778 

0 014925 

0.042703 

23.418 




2 

32.7 

55.2 



43.95 1 

1,029.22 

1 

k 

31 

60 





II 

\/k 

0 032258 

0.016667 

0.048925 

20 439 




2 

36.0 

53.4 



44.70 

913,62 


k 

58 

87 





in 

i/k 

0.017241 

0.011494 

0 028735 

34.801 




X 

30.1 

46 8 



38.45 

1,338 10 


k 

43 

68 





IV 

1 /k 

0 023256 

0 014706 

0 037962 

26 342 




2 

41.6 

43.5 



42 55 

1,120.85 






105 000 


4,401.79 

Sum of recip- 







rocals 

0. 100533 + 0 057792 — 0.158325 = sum of column, also. 


Weights, W 

9.9470 

+ 17.3034 

- 27 2504 



Unweighted 







means, x 

1 35 10 

49 72 



42.41 


m 


349 14 +860.33 



** 

1,209.47 


Weighted sums of squares: 

Soils: 4[SWx*-(SW?y/SW]= 4[185,215.15 — (4,401.79) a /105.000] ■» 2,736.8 
Tenancy: 16&SW- (SWx)VS\V\ « 16[ 55,030 42 - (1,209.47)V27.2504] -21,597.3 


Completed Analysis of Variance 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 


Soil groups 

3 

2,736 8 

912 


Tenancy 

1 

21,597.3 

21,597 


Interaction 

3 

6,020 4 

2,007 


Farms 

442 


830 



On the assumption that interaction is present, the unweighted differ- 
ences and means in the rows and columns are unbiased estimates. The 
weighted squares are used to calculate variance only. 
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EXAMPLE 1 1.36 — Becker and Hall (2) determined the number of oocysts pro- 
duced by rats of 5 strains during immunization with Eimeria miyairii. The unit of 
measurement is 10® oocysts. 


Sex 

i 

Strain 

Lambert 

Lo 

Hi 

W.E.L. 

Wistar (A) 

Male 

k 

8 

14 

20 

8 

9 


£ 

36.1 

94.9 

194 4 

64.1 

175.7 

Female 

k 

7 

14 

21 

10 

8 


£ 

31 9 

68.6 

187.3 

89 2 

148.4 


Verify the completed analysis of variance quoted from the original article: 


Sex 

1 

2,594 6 

2,594.6 

104,391.4 

Strain 

4 

417,565 6 

Interaction 

4 

8,805 3 

2,201.3 

Error 

109 

332,962.9 

3,054.7 


You will not be able to duplicate these numbers exactly because the means are reported 
to only 3 significant digits. Your results should approximate the first 3 figures in the 
mean squares, and no more are necessary for testing. 


EXAMPLE 11.37 — Another set of mice from Gowen’s investigations (10) yielded 
the following data on days to death : 


Strain 


Organism 

9 D 

11C 

DSC 1 

z 

k 

172 

70 

24 


X 

4.42 

6 97 

8 38 

Ba 

k 

182 

110 

33 



3 63 

4 43 

4 45 


The variance of the individual days to death was 3.54. Compute the mean square 
for interaction, 84.9, and complete the analysis by the appropriate method of weighted 
squares. An*. Mean square for organisms, 247; for strains, 494. 


11.12 — Disproportionate subclass numbers. TheRXCtable. Now 

we shall give the general methods of which those in the last two sections 
are special cases. Ordinarily the foregoing easier methods are adequate, 
but the more extensive tables appear often enough to call for the appro- 
priate treatment. 

The easy method of testing interaction is not available for tables with 
more than 2 rows and columns. This requires shrewder appraisal of the 
situation, because much time can be lost in calculation only to learn that 
the wrong method has been chosen. Examine the succession of differ- 
ences in the several rows or columns: if they conform to a pattern, inter- 
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action is likely negligible. The size of the error mean square, together 
with the degrees of freedom available, helps in forming judgments. 

In case of doubt, the method of unweighted means, section 11.10-/?, 
(not hard to compute) may furnish the clue: this is especially true if 
the subclass numbers are not very different. While this method of un- 
weighted means is not exact in the general table, though it was exact in 
the earlier section, it may convince you that no further analysis is necessary. 

An illustration is found in table 11.26, abstracted as a representative 
sample from more extensive tabulations (6) . The object of the investiga- 
tion was to learn if season affected these gains. As explained before, the 
subclass numbers are ignored, the method of calculation being that of 
section 11.2 applied to the mean gains. All calculations are indicated in 
the table. 

The completed analysis shows that the variation in weight-gains is only 
such as may be expected in sampling from a single normal population. 
The weighted mean of all the gains, 33.8 pounds per month, is the appro- 
priate estimate in this undifferentiated sample, and s = \/29S = 17.2 
pounds per month measures the variation. 


TABLE 11.26 

Monthly Mean Gains in Weight (Pounds) of Holstein Heifers at Three Ages 


Month of 
Gain 

Age in Months 

13 

18 | 

24 

k £ 

k £ 

k £ 

August 

September 

October 

November 

5 22 2 

8 24 4 

4 45 0 

4 18 0 

1 21 4 

3 30 3 

3 30 3 

3 67.0 

3 40 7 

1 39 0 

3 48 3 

6 49 0 


Mean square of individuals within subclasses = 931.8, 
d.f . = 38 (from original data) 


1 ^ 

K 


1 ( 1+1 + ...+1 + A 

12 \5 8 3 ^ 6 ^ 


0.3168 


Error mean square for comparison with mean squares calculated 
from means — (931 .8) (0.31 68) =* 295 


Completed Analysis of Variance of Means 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Months 

3 

561 

187 

Ages 

2 

573 

287 

Interaction 

6 

1,185 

198 

Error 

38 


295 
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If some of the F-ratios had been near the levels of significance, one 
of the exact methods, A or B to follow, would be available: the approxi- 
mate method would presumably afford an accurate enough test of inter- 
action to warrant decision as to whether A or B should be used. 

If the subclass numbers are quite different, especially if they seem to be 
characteristic of the population, another exploratory method may be 
available — the method of proportional subclass numbers (18) (19). The 
method is illustrated in table 11.27. In section 9.9 the observed subclass 


TABLE 11.27 

Farm Acres in Corn Classified by Tenure and Soil Productivity. 
Audubon County, Iowa (21) 




Owner 

Renter 

Mixed 



Soil 


Ob- Ex- 

Ob- Ex- 

i 

X 

A 

O 



Class 

\ 

served pected 

served pected 

served pected 

Sk 

SX 


k 

36 36 75 

67 62 92 

49 52.33 

152 


I 

X 

32 7 

55.2 

50 6 




SX 

1,202 

3,473 

2,648 


7,323 


k 

31 33 85 

60 57 95 

49 48.20 

140 | 


II 

X 

36 0 

53 4 

47 1 




SX 

1,219 

3,095 

2,270 

I 

1 

6,584 


k 

58 54 40 

87 93 13 

80 77 47 

225 


III 

$ 

30 1 

46.8 

40 1 




sx 

1,637 

4,358 

3,107 


9,102 

Sk 


125 

214 

178 

517 


sx 


4,058 

10,926 

8,025 


23,009 


Analysis of Variance Using Expected Sums and Numbers 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Soils 

2 

6,635 

3,318 

Tenure 

2 

27,367 

13,684 

Interaction 

4 

883 

221 

Error (from original data) 

507 


830 


numbers were shown to be independent, so that the proportional numbers 
may be taken to represent the distribution in the population. The product 
of the proportional number by the subclass mean gives a value of SX, and 
these two new statistics are used to calculate the analysis of variance in 
the manner of section 11.9. 

There is no evidence of interaction in the population, while the tenure 
means are doubtless different. If it seems necessary to make a more critical 
test of the soil effects, the method A below is appropriate. 
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A. INTERACTION NEGLIGIBLE 
The Method of Fitting Constants 

This method leads to a test of the significance of interaction. If inter- 
action turns out to be negligible, the accompanying estimates and other 
tests of significance are unbiased, and the analysis is complete. But if 
interaction becomes evident, the main effects and their tests are only 


TABLE 11.28 

The Method of Fitting Constants. Days to Death in 3 Strains of Mice Inocu- 
lated With 3 Isolations of the Typhoid Bacillus 


Organ- 
ism j 


Strain of Mice 







RI 

z 

Ba 

Sk 

SX 



b + £ 

' 

9 D 

k 

34 

31 

33 

98 

385 



3 9490 


k/Sk 

0.34694 

0.31633 

0.33673 



3 

9286 


11 C 

k 

66 

78 

113 

257 

1,442 



5.8600 


k/Sk 

0 25681 

0.30350 

0 43969 



5 

6109 


DSC 1 

k 

107 

133 

188 

428 

2,523 



6 1375 


k/Sk 

0 25000 

0 31075 

0 43925 

I 


5 

8949 


Sk 

SX 

207 

1,271 

242 

1,692 

334 

1,387 

783 

4,350 




a 

0.4506 

1.2240 

-1 6746 




x * 

5 3155 


Preliminary Analysis of Variance 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Subclasses 

8 

1,785 58 


Organisms 

2 

309 47 L rr 

1,227 19/ 1,536 66 


Strains 

2 


Individuals 

774 

5.015 


an f (34) (0.34694) 4 ... 4 (107)(0.25000) - 207]*! 
4- [ (34) (0.31 633) 4 ... 4 (107)(0.31075]a 2 
4- [(34) (0.33673) 4- ... 4- (107)(0.43925]a, 

- (34) (3.9286) 4 ... 4 (107)(5 8949) - 1,271 

(and so on to the last a) 

a,: [(33) (0.34694) 4 ... 4 (188) (0.2500)]*! 

4 [(33) (0.31 633) 4* ... 4- (188)(0.31075)]* 2 
4 [(33) (0.33673) 4 ... 4 (188)(0.43925) - 334]* a 
® (33)(3.9286) 4 . . . (188)(5 8949) - 1,387 
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approximate: then the method following this may be applied. It is de- 
sirable, therefore, to anticipate interaction if possible — from theoretical or 
experiential considerations, from examination of the data or from the 
foregoing analysis of unweighted means. Of course, the discovery of the 
presence or absence of interaction may be the objective of the experiment, 
so that this method of fitting constants is final. On the other hand, inter- 
action may be obvious, in which case, go directly to the following method 
of weighted squares for estimates and tests of the main effects. 

More of Gowen’s data (10) are used for illustration, table 11.28. There 
was strong evidence of interaction in tables 11.20 and 11.21, and it is 
anticipated here: we shall apply both methods for the sake of contrast. 

The table is made up from the original data containing the subclass 
numbers and sums of days to death. If the R X C table is rectangular in- 
stead of square, arrange it so that R is large and C small — the fewer the 
columns the easier the calculations. Each row of ratios should add to 
unity. The means, x R > are weighted means, SX/Sk. 

The 7 constants to be fitted by regression methods are an overall mean, 
x , deviations in each of the columns, a , and in each of the rows, b. These 
deviations are 

a — x c — x and b = x R — x, 

which could have been calculated in section 11.3 and used to get the 
regression values, Jt: it can be shown quite easily that 

X = a + b + x 

After the constants are computed, they will be used to make a table of 
regression values with zero interaction like 11.4. Note that, since the 
a ? s and b’s are deviations from x, therefore Sa = Sb = 0. 

Corresponding to each column an ^-equation is written as outlined 
at the bottom of the table. In a\ the subclass numbers in column 1 are 
used over and over, each time with successive columns of ratios; and in 
the right member, with the column of means. In a 2 the numbers in column 
2 are applied to the same pattern of ratios and means. The only fault in 
the pattern is the subtraction of Sk from one of the terms in each equation. 

The coefficients of the a" s, easily calculated with a machine, are set 
down in the following set of simultaneous linear equations: 

— 151.505 + 64.036(22 + 87.468 a 9 = -136.35 

64.036 a 1 - 167.191 a 2 + 103.155 a 8 = -348.54 

87.468 &\ + 103.155 a 2 - 190.624 a 3 = 484.92 

In verification of the computations, observe that the coefficients occur 
in pairs symmetrically arranged with respect to the principal diagonal, 
and that the sum of the coefficients in each of the 3 columns, as well as the 
sum of the right hand members, is zero (except for rounding errors). 

Since Sa = 0, the first step is to substitute a$ = -~ai — a 2 in each 
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equation. This is easily done: simply subtract 87.468 from each of the 
preceding coefficients in the first equation, and 103.155 from each in the 
second, the results being 

— 238.973 a\ - 23.432 a 2 = -136.35 

- 39.119 ai -270.346 ^2 = -348.54 

The third equation is merely the sum of these two and may be discarded. 
The solution is 

= 0.4506 

02 = 1 .2240 

and, therefore, 0 3 = —1.6746 

Verify by substitution in one or more of the original equations. It is 
convenient to enter the 0 ’s in the last row of the table. 

A quantity, b + in each row is now calculated from the 0 ’s together 
with £ r and the ratios in the same row. As an example, in row 1, 

bx + & = 3.9286 - (0.34694) (0.4506) - ... - (0.33673)(- 1.6746) 

= 3.9490 

Similarly, b 2 + x = 5.8600 

b z + x = 6.1375 


3x — 15 9465 (because Sb = 0) 
x= 5.3155 
h = -1.3665 
b 2 = 0.5445 

b s - 0 8220 

The “reduction in sum of squares due to fitting constants” is now calcu- 
lated from the constants and the original SX in the borders of the table 
by this process: 

(0.4506) (1,271) + (1.2240) (1,692)+ . . . + (6. 1 375) (2,523) - (4,350) 2 /783 

= 1,609.78 

Deducting this from the sum of squares for subclasses, the interaction sum 
of squares is attained, 

1,785.58 - 1,609.78 = 175.80 

Since there are 4 degrees of freedom for interaction, we have 

Mean square for interaction = 175.80/4 = 43.95 
F = 43.95/5.015 - 8.76, d.f. = 4 and 774, F M = 3.35 

With rejection of the hypothesis of zero interaction, we are warned 
that further computations by this method yield somewhat biased esti- 
mates. The mean square for interaction is correctly estimated and tested, 
but we must turn to the method of weighted squares, following this, to 
get further information. However, as illustration of the remaining pro- 
cedure if interaction were negligible, we complete the calculations. 


Adding, 

Finally, 
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With the constants now fitted, the regression values, It = a + b + 
are calculated in table 11.29. If there were no interaction, the means for 

TABLE 11.29 

Regression Values From Fitted Constants. Interaction Assumed Negligible. 
Mouse Data of Table 11.27 




1 RI 


Ba 


Organism 

Constants 

\ ai =0.4506 

as = 1.2240 

03 1.6746 

Mean 

9 D 

11 C 

DSC 1 

h = -1.3665 
b* » 0 5445 
h - 0 8220 

4 3996 
6.3106 
6.5881 

5 1730 
7.0840 
7.3615 

2.2744 

4.1854 

4 4629 

3 9490 

5 8600 
6.1375 

Mean 


5 7661 

6.5395 

3.6409 



Completed Analysis 


Source of Variation 

Degrees of 
Freedom 

Sum of Squares 

Mean 

Square 

Organisms 

2 

309.47 — ( — 73.12) = 382 59 

191 30 

Strains 

2 

1,227.19 — (—73 12) “1 , 300.31 

650.16 

Interaction 

4 

175 80 

43.95 

Mice 

774 


5 015 


strains and organisms would be the correct estimates of days to death. 
Also, the analysis of variance would be completed by a correction to be 
deducted from the sums of squares for organism and strain in the pre- 
liminary analysis: 

Correction = (total for organism and strain) — (reduction due to constants) 
- 1,536.66 - 1,609.78 = -73.12 

The completed analysis is entered at the bottom of the table. 

The nature of the reduction in sum of squares due to fitting constants 
will be clear if you go through this process: calculate the sum of squares 
for the 9 subclass means in table 11.29, having no interaction, and com- 
pare it with the corresponding sum in table 11.28. The difference is the 
interaction sum of squares, but rounding errors have crept into the fourth 
significant figure. 

B. INTERACTION PRESENT 
The Method of Weighted Squares of Means 


This method gives unbiased estimates and tests of the main effects if 
interaction is present. It furnishes no information about interaction in 
the general R X C table, though it did give this information in the special 
cases where one or both of R and C were 2. In the 2X2 table, the 
methods of unweighted means and of weighted squares are equivalent. 
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Since we now have evidence of what was anticipated about the mouse 
data, that interaction is present, we turn to the present method to com- 
plete the examination correctly. Table 11.30 is a simple extension of 
table 11.25, and requires no further explanation. 

Before embarking on this computation, ask yourself this question: 
since the strains do not respond in the same way to the organisms, is any- 
thing to be gained by estimates and tests of the overall averages? On the 


TABLE 11.30 

Computations in R X C Table With Disproportionate Subclass Numbers. 
Interaction Present 

Weighted squares of means. Mouse data from table 11.28. 


Organ- 

ism 


Strain 

Sum of 
Recipro- 
cals 

Weight 

W 

Un- 

weighted 

Means 

* 

W£ 

RI 


Ba 


k 

34 

31 

33 





9 D 

\/k 

0 0294 

* 0 0322 

0 0303 

0 0919 

10 873 




£ 

4.0000 

4 0323 

3.7576 



3.9300 

42 731 


k 

66 

78 

113 





11C 

1 Ik 

0.0152 

0.0128 

0 0088 

0.0368 

27 158 




£ 

6 4545 

6 7821 

4.3097 



5.8488 

158 842 


k 

107 

133 

188 





DSC 1 

1 /k 

0 0093 

0 0075 

0 0053 

0 0221 

45 078 




£ 

6.6262 

7 8045 

4 1277 



6 1861 

278 857 







83 109 ; 


480 430 

Sum of 1 fk 

1 0.0539-f 0 0525 + 

0 0444 

- 0 1508 

= sum of column, also. 

Weights, W 

18.550 

4- 19 012 + 22 486 

= 

60 048 



Unw’ted x 

5 6936 

6 2063 

4.0650 





m 


105.616 

4-117 994 4- 

91.406 




315 016 


Weighted sums of squares: 

Organisms, 9 [SW& - (. SWxY/SW ] = 403 11 
Strains, do. = 473 40 


Completed Analysis of Variance 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Organisms 

2 

403 11 

201 6 

Strains 

2 

473.40 

236.7 

Interaction (table 11.28) 

4 


43.95 

Mice 

774 


5.015 


* The computations were made with 6 decimal places so rounding discrepancies 
will be found if only 4 are used. 



Section 11.15 Analysis of Variance: Multiple Classification 


SOI 


assumption that the border means do represent some specifiable popula- 
tions, we proceed with the illustration. Since interaction is present, we 
take the unweighted means as estimates, and arrive at a test of the signifi- 
cance of differences among them. 

It is worth while to observe the effect of interaction by comparing 
table 11.30, in which it was properly taken into account, with 11.29 
where it was ignored. 

In closing these sections on disproportionate subclass numbers, it 
should be observed that use of the appropriate method of analysis ordi- 
narily makes the efficiency of such experiments (that is, the amount of 
information per individual) close to 100%. Attention has been drawn 
to contrasts between the results of appropriate and inappropriate methods 
applied to the same data: clearly, much information can be lost by use of 
the wrong method. Other things being equal, data should be collected 
in equal or proportional numbers because the information per individual 
is maximized and the computations are easier. No statistical method can 
compensate for the loss of data — it is only the remaining information that 
can be extracted, and that at some loss of efficiency. 

11.13 — Experiments repeated in several places or years. Agricul- 
tural experiments are often repeated in various localities and in successive 
seasons in order to learn about various environmental effects on the varie- 
ties or treatments being studied. Each experiment is usually of some 
replicated design from which the local responses may be evaluated in the 
manner already described. The object of the repetition is to determine if 
one can make recommendations which are widely applicable. 

Such groups of experiments pose a variety of problems for whose 
solution there is no simple set of rules. The difficulties are discussed by 
Yates and Cochran (30) who illustrated methods suitable for particular 
sets of data. 

Some of the principles arc applied to the data in table 11.31, selected 
from the returns from the 1943 cooperative seed treatment tests on Mukden 
soybeans (16). Four treated lots of 100 seeds, and 1 untreated, were 
planted in 5 randomized blocks at each participating station. The total 
numbers of emerging plants (from 500 seeds) are shown for 5 locations. 

From these data, supplemented by the local analyses of variance at the 
bottom of the table, it is evident that conditions affecting emergence 
varied greatly from place to place. Only in some of the stations were 
the treatments effective. Even the error mean squares differed notably, 
a circumstance not unusual in this kind of investigation. Our problem is 
to discover any general features of the data that may be useful. 

The first thing to learn is whether the mean squares for experimental 
error may be estimates of some common variance: if so, they can be com- 
bined into a single more reliable estimate. The method is that of section 
10.13 with n = 5 and k — 1 = 16. Corrected chi-square turns out to 
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TABLE 11.31 

Number of Emerging Plants (500 Seeds) in 5 Plots. Cooperative Seed 
Treatment Trials With Mukden Soybeans. 1943 


Location 

Untreated 

Arasan 

Spergon 

Semesan, Jr. 

Fermate 

Total 

Michigan 

360 

356 

362 

350 

373 

1,801 

Minnesota 

302 

354 

349 

332 

332 

1,669 

Wisconsin 

408 

407 

391 

391 

409 

2,006 

Virginia 

244 

267 

293 

235 

278 

1,317 

Rhode Island 

373 

387 

406 

394 

375 

1,935 

Total 

1 

1,687 

1,771 

1,801 

1,702 

1,767 

8,728 


Mean Squares From Original Analyses of Variance 


Source of 
Variation 

Degrees of 
Freedom 

Location 

Michigan 

Minnesota 

Wisconsin 

Virginia 

Rhode Island 

Treatment 

4 

14.44 

82 84* 

17 44 

114 26* 

37 50 

Block 

4 

185 14 

54 64 

5 64 

70 76 

4.80 

Error 

16 

42 29 

26 67 

30 64 

26 34 

13 05 


be 5.22, dj. — 4, indicating little more than average sampling variation 
from a common or 2 ; hence, we shall pool the 5 error mean squares. 

Among all the states taking part in this experiment, the error mean 
squares varied significantly; five with homogeneous variance were picked 
out to illustrate what can be done with a common estimate. If error is 
heterogeneous, one special method which can be used will be indicated 
later. 

The 5 local experiments can be combined into a single analysis of 
variance as in table 11.32. For replications and error, the degrees of 
freedom as well as the sums of squares from the several stations are pooled 
in the manner of chapters 4 and 10. On the other hand, since each treat- 


TABLE 11.32 

Analysis of Variance of 5 Randomized Blocks Experiments. The Soybean Data 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Treatments 

4 

380 29 

95 07 

Location 

4 

11,852 61 

2,963 15 

Interaction 

16 

685 63 

42 85 

Replications 

20 

1,283 92 


Experimental error 

80 

2,223.68 

27.80 


For interaction: F « 42.85/27.80 — 1.54, F 05 » 1.77 

For treatment F » 95.07/42.85 ■ 2.22, F ot ~ 3.01 (see text) 
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ment is repeated at all places, the totals in the 2- way table, 11.31, yield 
sums of squares for location, for treatment, and for interaction, the calcu- 
lation being that of steps 3, 5, 6, and 7, section 11.8. Note that each cell 
contains the sum of 5 plot records. 

Since the pooled error mean square is an average of homogeneous local 
errors, it may be used to test the interaction; and also the treatment effect 
in the particular set of places chosen. Thus, in this experiment the value of F 
for treatments is 95.07/27.80 = 3.42, P = 0.02. But ordinarily this test 
is not of much interest. What we wish to know is whether a treatment is 
likely to be superior not only here but elsewhere in the region. Evidence 
on this depends upon the manner in which the places for the experiments 
were selected. If they can be considered a random sample of all places in 
the specified region, then the test for treatments is that given in the table — 
a component must be included in the error mean square (the treatment- 
place interaction) to allow for the fact that treatment differences may vary 
from place to place over the sampled region. The result of the test indicates 
that no general recommendation can be made with reasonable confidence. 

In this example, the stations are certainly not randomly chosen, nor 
were they in the entire cooperative seed treatment trial, though the whole 
group of stations might be considered representative of some region. The 
requirement of randomness is apparently never met in this kind of work, 
so that statements about probability must be considered inexact. 

Since the treatment and interaction mean squares are both larger than 
experimental error, it is appropriate to look for sources of the additional 
variation. One naturally asks if emergence in the treated plots as a group 
differs from that in the untreated. As before (section 11.8), the data are 
condensed into the form: 



Michigan 

Minnesota 

Wisconsin 

Virginia 

Rhode Island 

Untreated 

360 

302 

408 

244 

373 

Treated 

1,441 

1,367 

1,598 

1,073 

1,562 


The results in table 11.33 show that substantial portions of the sums of 
squares of both treatment and interaction are due to the more numerous 
emerging plants from the treated seeds, but the remaining mean squares 
are still larger than experimental error. 

From further examination of table 11.31 it appears that Semesan Jr. 
tends to be less effective than the other treatments except in Wisconsin 
and Rhode Island. Another 2X5 table with emerging plants from 
Semesan Jr. treated seeds in one line and totals from the other 3 treat- 
ments in the second, with another set of computations like those in section 
11.8, results in the sums of squares for “Semesan Jr. vs. others' 5 in table 
11.33. Deduction of these reduces the remaining mean squares to amounts 
easily attributed to experimental variation. 
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TABLE 11.33 

Analysis of the Soybean Seed Treatment Data to Isolate Sources of Variation * 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean ( 
Square 

Treatments: 

(4) 

(380.29) 


Untreated vs. treated 

1 

171.70 

171.70 

Semesan Jr. vs. others 

1 

180 88 

180.88 

Differences among remaining 3 

2 

27.71 

13.86 

Interaction: 

(16) 

(685.63) 


Untreated vs. treated 

4 

235.78 

58.94 

Semesan Jr. vs. others 

4 

189.84 

47.46 

Differences among remaining 3 

8 

260 01 

32.50 

Experimental error 

80 


27 80 


It is now plain that Arasan, Spergon, and Fermate have substantially 
the same effects in the several places, but that untreated plots and those 
planted with Semesan Jr. treated seeds introduce variation beyond that 
measured by experimental error. 

It remains to be said that the F-test for a comparison like untreated vs. 
treated, involving a single degree of freedom, 

F ~ 171.70/58.94 = 2.91, d.f. = 1 and 4, 

is little affected by the heterogeneity of experimental error usually en- 
countered in agricultural experiments. This kind of comparison is more 
easily tested by t: write down the 5 pairs of means (for untreated and 
treated seeds, respectively) in parallel columns, then proceed as in table 
2.2. The resulting / will be the same as y/F = \Z2.9l = 1.71, dj. = 4. 

11,14 — Three criteria of classification. With increasing knowledge 
about experimental material, the researcher may be able to extend the 
bases of his judgments about outcome. Not only sex and treatment may 
affect weight gains but also strain or generation, so that further classifica- 
tions are required in the analysis of variance. Another source of multiple 
classification is increasing complexity of treatment combinations. Both 
sources are involved in an experiment described by Winsor and Clarke 
(25), the object being to investigate the catch of various groups of plankton 
by 2 nets hauled horizontally, one 2 meters below the other. Ten hauls 
were made with the pair of nets at depths 29 and 31 meters respectively. 
It was observed that the standard deviations in the various groups were 
roughly proportional to the means, so that it seemed appropriate to 
examine the logarithms of the catch numbers (section 16.7). These are 
entered as the records in table 11.34. 

Each item in the table is classified by group, by haul and by position, 
but the total sum of squares is calculated, as usual, from the 100 entries, 
ignoring classification. 
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Correction: (242.36)7100 = 587.3837 

Total: (2.93) 2 + . . . + (1.45) 2 - 587.3837 = 31.3699 


This total is entered at the bottom of the table of analysis of variance. 

I The observations are now combined in three 2-way tables, 11.34. As 
examples: in A, haul 1, group I, 5.54 is the sum of the catches in the 
2 nets, 2.93 + 2.61; in C, group I, upper position, 28.69 = 2.93 + 3.05 
+ . . . + 3.03. In each of these tables the calculations are like steps 
3, 5, 6, and 7, section 11.8. Using B as illustration: 


3. Subclasses: 


(12.94) 2 +(11.60) 2 + . • . +(10.76) 2 
5 


-587.3837 = 4.1371 


5. Hauls: 


(24. 54) 2 + . . . + (22. 83) 2 
10 


- 587.3837 = 2.9969 


EXAMPLE 11.38- -In Minnesota during 1931, five varieties of barley were planted 
at 6 places, the yields (bushels per acre) being (12): 


Locality 

Block 

Variety 

Manchuria 

Glabion 

Trcbi 

Peatland 

Wisconsin 

Total 

University 

1 

29 

45 

41 

38 

44 

197 


2 

25 

39 

40 

30 

37 

171 


3 

27 

46 

37 

30 

36 

176 


Sum 

81 

130 

118 

98 

117 

544 

Waseca 

1 

48 

55 

64 

41 

61 

269 


2 

52 

53 

64 

52 

58 

279 


3 

47 

57 

64 

53 

57 

278 


Sum 

147 

165 

192 

146 

176 

826 

Morris 

1 

24 

28 

42 

25 

32 

151 


2 

25 

26 

42 

30 

26 

149 


3 i 

34 

33 

47 

35 

31 

180 


Sum 

83 

87 

131 

90 

89 

480 

Crookston 

1 

38 

40 

53 

40 

45 

216 


2 

38 

38 

40 

37 

59 

212 


3 

44 

37 

47 

47 

46 

221 


Sum 

120 

115 

140 

124 

150 

649 

Grand 

1 

34 

21 

38 

36 

32 

161 

Rapids 

2 

27 

30 

29 

39 

36 

161 

3 

38 

36 

22 

29 

36 

161 


Sum 

99 

87 

89 

104 

104 

483 

Duluth 

1 

29 

31 

37 

35 

30 

162 


2 

27 

33 

33 

28 

30 

151 


3 

32 

25 

32 

33 

35 

157 


Sum 

88 

89 

102 

96 

95 

470 

Total 


618 

673 

772 

658 

731 

3452 


This example is extracted from more extensive data examined by Yates and Cochran 
(30). Can you isolate some of the sources of variation? 
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6. Positions: 


(124.50) 2 + (117.86) 2 
50 


587.3837 = 0.4409 


7. HXP: 4.1371 - (2.9969 + 0.4409) - 0.6993 


The sum of squares for second order (or three factor) interaction , group 
X haul X position, is the difference between the total and the accumu- 
lated sums of squares of the first 6 effects in the table. The corresponding 


TABLE 11.34 

Logarithms of Catches of 5 Groups of Plankton in 2 Nets Hauled 10 Times 


Haul 

Posi- 

tion 

Group 

Total 

I 

II 

III 

IV 

V 

1 

Upper 

2 93 

2 60 

2.69 

3,04 

1 68 

12 94 


Lower 

2 61 

2 34 

2 41 

3 06 

1 18 

11.60 

2 

Upper 

3 05 

2 42 

2 93 

2 91 

1 94 

13 25 


Lower 

2 65 

2 13 

2 58 

2 65 

1 38 

11 39 

3 

Upper 

3 15 

2 34 

3 27 

2 90 

2 17 

13 83 


Lower 

3 02 

2 19 

3 22 

2 88 

2 01 

13 32 

4 

Upper 

2 54 

1 65 

2 59 

2 44 

1 97 

11 19 


Lower 

2 56 

1.82 

2 74 

2 69 

1 76 

11 57 

5 

Upper 

2 65 

1 90 

2 26 

2 85 

1 68 

11 34 


Lower 

2 30 

1 61 

2 15 

2 75 

1 15 

9 96 

6 

Upper 

2 85 

2 08 

2 98 

2 38 

1 81 

12 10 


Lower 

3 16 

2 12 

3 12 

2 69 

1 97 

13 06 

7 

Upper 

3 07 

2 04 

3 76 

2 96 

1.81 

13 64 


Lower 

3 08 

2 03 

3 64 

2 47 

1 77 

! 12 99 

8 

Upper 

2 44 

1 58 

3 03 

2 68 

1 74 

11 47 


Lower 

2 58 

1 65 

3 04 

2 61 

1 54 

11 42 

9 

Upper 

2 98 

1 86 

3 28 

2 60 

1 95 

12 67 


Lower 

2 92 

1 84 

2 91 

2 54 

1 58 

11.79 

10 

Upper j 

3 03 

2 06 

2 60 

2 64 

1 74 

12 07 


Lower 

2 79 

1 61 

2 52 

2 39 

1 45 

10 76 

Total 

56 36 

39 87 

57.72 

54 13 

34 28 

242 36 


Analysis of Variance 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Square 

Mean 

Square 

Main effects: 




Groups (G) 

4 

22 7583 

5.6896 

Hauls (H) 

9 

2.9969 

0 3330 

Positions (P) 

1 

0.4409 

0 4409 

First order interactions: 




G X H 

36 

3 9938 

0 1109 

GXP 

4 

0 1251 

0.0313 

HXP 

9 

0.6993 

0 0777 

Second order interaction: 




G X H X P 

36 

0 3556 

0.0099 

Total 

99 

31 3699 
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TABLE 11.34 — ( Continued ) 

Three 2-Way Tables, Summary of the Plankton Data 


A 

Group 


Haul 

I 


II 


III 

IV 


V 


Total 

1 

5 54 


4 94 


5 10 

6 10 


2 86 


24.54 

2 

5 70 


4 55 


5 51 

5 56 


3.32 


24.64 

3 

6 17 


4 53 


6 49 

5.78 


4.18 


27.15 

4 

5 10 


3.47 


5 33 

5.13 


3 73 


22.76 

5 

4.95 


3 51 


4 41 

5 60 


2 83 


21 30 

6 

6 01 


4.20 


6 10 

5 07 


3.78 


25 16 

7 

6.15 


4 07 


7 40 

5 43 


3.58 


26 63 

8 

5 02 


3 23 


6 07 

5 29 


3 28 


22.89 

9 

5 90 


3 70 


6 19 

5 14 


3.53 


24 46 

10 

5 82 


3 67 


5 12 

5 *03 


3 19 


22 83 

Total 

56 36 


39 87 


57 72 

54.13 


34 28 


242.36 

B 

Haul 


Position 

1 

2 

3 

4 

5 6 

7 

8 

9 

10 

Total 

Upper 

12 94 13 

25 

13 83 11 

19 

11 34 12.10 

13 64 11 

47 

12 67 12 

07 

124 50 

Lower 

11 60 11 

39 

13 32 11 

57 

9 96 13 06 

12 99 11 

42 

11 79 10 

76 

117 86 

Total | 

24 54 24 

64 

27 15 22 

76 

21 30 25 16 

26 63 22 

89 

24 46 22 

.83 

242 36 

C 

Group 


Position | 

I 


II 


III 

IV 


V 


Total 

U pper 

28 69 


20 53 


29 39 

27 40 


18 49 


124 50 

Lower 

27 67 


19 34 


28 33 

26 73 


15 79 


117 86 

Total 

56 36 


39 87 


57 72 

54 13 


34 28 


242.36 


degrees of freedom are the product of those for the 3 main effects, 
(4) (9) (1 ) = 36. 

The three factor interaction indicates: 

1 . The difference between the haul X group interactions in the 2 posi- 
tions; or 

2. The variation of the haul X position interactions in the 5 groups; or 

3. The variation of the group X position interactions in the 10 hauls. 
Components of variance in a 3-way tabic are as follows: 

Groups a 1 + 10(7 G p 2 + 2<j 0H 2 + 20cr o 2 

Hauls <r 2 + 5<W + 2 <x GH 2 + lOcr* 2 

Position <r 2 + 5 a H p 2 + lOo^ 2 + 50cr P 2 

GXH cr 2 + 2a GH 2 

GXP a 2 + 1(W 

HXP a 2 + 5(r*e 2 

GXHXP <j 2 
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The coefficients of the sigmas are the numbers of items making up the 
corresponding cell and border totals in table 11.34. In B , for instance, 
each cell total is the sum of 5 items; hence, the coefficient of c HP 2 is 5. 
Similarly, the coefficient of a H 2 is 10 because each haul total comprises 10 
observations. 

By equating the sample estimates in the analysis of variance to corre- 
sponding component sums it is easy to compute s 2 = 0.0099, s HP 2 = 0.01 36, 
s GP 2 — 0.0021, Sgu 2 = 0.0505, s P 2 = 0.0068, 2 = 0.0154, and s a 2 = 
0.2779. 

The component analysis is worked out on the following assumptions: 
(i) The variation of any X about m has 7 sources, one common to all 
members of the same group, one to all members of the same haul, etc. 
For a description of this common variation, review the sampling of table 
10.9. (ii) The various sources of variation are independent and their 
effects additive. 

Second order interaction often does not seem to be due to any real 
biological relationship but only to random deviations such as those meas- 
ured by discrepance in the randomized blocks experiment (section 11.3). 
This second order discrepance is, therefore, commonly taken as an esti- 
mate of experimental error; that is, of a 2 in a normally distributed popu- 
lation with m = 0. If one wishes to estimate and test second order inter- 
action, he should provide two or more experimental units instead of one in 
each cell of the 3-way table, as was done in the 2-way table 11.14. 

On the assumption that s l measures only random variation, it becomes 
interesting to use it for testing the significance of other effects For testing 
the hypothesis that any first order interaction is zero, the F ratio is clearly 
indicated by the components of variance. Thus, for testing <j GP 2 — 0, 


J - 2 + 1(W 


0.0313 

0.0099 


= 3.16, P = 0.03 


While all the two-factor interactions are significant, examination of 
table 11.34 fails to reveal any very striking contrasts. The most notable 
seems to be the large difference between the catches of group V in the two 
nets. 

The method of testing a null hypothesis about a main effect depends 
on the assumptions made. In testing o> 2 = 0, for example, one might 
assume that the group and haul means are unchanging parameters; that 
is, that the test is to apply to this particular experiment only. Then, 

F = 0.4409/0.0099 = 44.54, d.f. = 1 and 36 

But under this assumption the test is not very interesting. It could well 
be confined to the specific groups enumerated, but not to the same hauls. 
A more reasonable assumption is that the hauls constitute a random sam- 
ple from some population for which o> 2 = 0 may be tested, the groups 
being unvarying as before. In that case, the component for the position 
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differences with haul, 5 s ffP 2 i must be added to the estimate of error, and 
F = 0.4409/0.0777 = 5.67, d.f. = 1 and 9, P = 0.05 

Concerning the position and position X group effects, Winsor and 
Clarke said, “The interpretation to be put on these figures is not entirely 
clear to us. Their statistical significance is well enough established. . . . 
We hesitate, however, to assert that they give a true picture of the distri- 
bution in depth of the plankton organisms.” Other data led them to 
suspect that the results in this experiment may have been due to a larger 
net hauled between the two here reported. 

Among the components of variance, two stand out; s OH 2 = 0.0505 and 
s 0 2 = 0.2779. The latter is due to the differences in abundance of the 
groups of organisms. The former seems to reflect changing differences 
among the groups during the 4 hours occupied by the experiment. 

EXAMPLE 11.39 — Here is a 3-way table for practice: two observations in each 
cell of the table. 


Species 

Sex 


1 

Treatments 

2 3 

— 

A 

Male 

8, 

10 

8, 12 

7, 11 

7, 


Female 

3, 

7 

6, 8 

3, 3 

i, 

B 

Male 

5, 

3, 

9 

7, 7 

9, 11 

n, 


Female 

3 

4, 8 

5, 9 

6, 

C 

Male 

7, 

9 

6, 14 

12, 16 

14, 


Female 

4, 

4 

4, 6 

10, 12 

12, 


Analysis of Variance 


Source of Variation 

Degrees of Freedom 


Mean Square 

T reatments 

3 


30 

Species 

2 


52 

Sex 

1 


192 

Treatment-species 

6 


30 

Treatment-sex 

3 


2 

Species-sex 

2 


4 

T reatment-species-sex 

6 


2 

Individuals 

24 


5 

Total 

47 


11.15 — The split plot experiment. It is often desirable to get rather 
precise information on one set of treatments as well as on the interaction 
of this set with a second, but to forego such precision on the second set. 
For example, 3 sources of a vitamin might be compared by trying them on 
3 males of the same litter, replicating the experiment on 20 litters. This 
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would be a randomized blocks design with 38 d.f. for error. Super- 
imposed on this could be some experiment with the litters as units: four 
types of housing could be tried, one litter to each type, thus allowing 
5 replications. The main treatments (housings) would not be compared so 
accurately as the subtreatments (sources of vitamin) for two reasons; 
less replication is provided, and litter differences are included in the error 
for evaluating the housing effects. Nevertheless, some information about 
housing may be got at little extra expense, and any interaction between 
housing and vitamin will be accurately evaluated. 


TABLE 11.35 

Yields of 3 Varieties of Alfalfa (Tons per Acre) in 1944 Following 4 Dates 
of Final Cutting in 1943 



p- ■* 



Block 



Variety 

Date 

1 

2 

3 

4 

5 

6 

Ladak 

A 

2 17 

1 88 

1 62 

2 34 

1 58 

1 66 


B 

1 58 

1 26 

1 22 

1 59 

1 25 

0 94 


C 

2 29 

1 60 

1 67 

1 91 

1 39 

1 12 


D 

2 23 

2 01 

1 82 

2 10 

1 66 

1 10 



8 27 

6 75 

6 33 

7 94 

5 88 

4 82 

Cossack 

A 

2 33 

2 01 

1 70 

1 78 

1 42 

1 35 


B 

1 38 

1 30 

1 85 

1 09 

1 13 

1 06 


C 

1 86 

1 70 

1 81 

1 54 

1 67 

0 88 


D 

2 27 

1 81 

2 01 

1 40 

1 31 

1 06 



7 84 

6 82 

7 37 

5 81 

5 53 

4 35 

Ranger 

A 

1 75 

1 95 

2 13 

1 78 

1 31 

1 30 

B 

1 52 

1 47 

1 80 

1 37 

1 01 

1 31 


C 

1 55 

1 61 

1 82 

1 56 

1 23 

1 13 


D 

1 56 

1 72 

1 99 

1 55 

1 51 

1 33 



6 38 

6 75 

7 74 

6 26 

5 06 

5 07 

Total 

22 49 

20 32 

21 44 

20 01 

16 47 

14 24 











Date of Cutting 


Variety 

A 

B 

C 

D 

Total 

Ladak 

11 25 

7 84 

9 98 

10 92 

39 99 

Cossack 

10 59 

7 81 

9 46 

9 86 

37 72 

Ranger 

10 22 

8.48 

8 90 

9 66 

37 26 

Total 

32 06 

24 13 

28 34 

30 44 

114 97 

Mean (tons 






per acre) 

1.78 

1.34 

1 57 

1 69 
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The data in table 11.35 are from a split plot field trial on alfalfa. 
Three varieties were planted on the main plots, and these were replicated 
6 times (24). As shown in figure 11.3, each of the 18 main plots was split 
into 4 subplots on which there were 4 dates of final cutting: A, no third 
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Fig. 11.3 — First 2 blocks of split plot experiment on alfalfa, illustiating random 
arrangement of main and sub-plots. 


cutting, the second harvest on all plots being on July 27; B , third cutting 
on September 1, 1943; C, September 20; and D, October 7. Not only 
must the varieties be randomized on the main plots, but the dates must 
be assigned at random to the subplots of each main plot. 

The analysis of variance in table 11.36 reflects the limitations imposed 
by the split plot design. The main plot analysis is that of a randomized 
blocks experiment with 3 varieties replicated in 6 blocks. In the subplot 
analysis, the 4 dates of cutting are replicated on each of the 18 main 


TABLE 11.36 


Analysis of Variance of Spi it Plot Experiment on Alfalfa 


Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Main plots: 

Varieties 

2 

0 1781 

0 0890 

Blocks 

5 

4 1499 

0 8300 

Main plot error 

10 

1 3622 

0 1362 

Subplots: 



0 6542 

Dates of cutting 

3 

1 9625 

Date X variety 

6 

0 2105 

0 0351 

Subplot error 

45 

1 2586 

0 0280 


For varieties: F — 0 0890/0 1362 *=* 0 65 

For dates: F — 0 6542/0 0280 = 23 36 

Date X variety: F = 0 0351/0 0280 = 1 25 
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plots. The error for subplots is usually smaller than that for main plots 
because the smaller plots lie closer together with less hazard of soil differ- 
ences. 

A comparison of this analysis with that of the 3-way table of the plank- 
ton sampling shows: (i) that the interaction between variety and block is 
the error for testing main treatments; and (ii) that there is no real second 
order interaction, but instead a pooling of the date-block discrepances 
(with (4 — 1)(6 — 1) = 15 d.f.) within the 3 varieties, leading to the 
second error (with 3 X 15 = 45 d.f.) for testing subtreatments and their 
interaction with main treatments. 

The computations are as follows: 

1. Correction: C = (114.9) 2 /72 = 183.5847 

2. Total: (2.17) 2 + . . . + (1.33) 2 - C = 9.1218 


3. Main plot: 


(8.27) 2 + . . . + (5.07) 2 


- C = 5.6902 


4. 

5. 

6 . 
7. 


Ty . (39.99) 2 +... + (37.26) 2 

Variety: — - C - 0.1781 

24 

(22.49) 2 + . . . + (14.24)2 

Block: — — L - C = 4.1499 

12 

Main plot error: 5.6902 - (0.1781 + 4.1499) = 1.3622 

Subclasses in variety-date table: 

(11. 25) 2 + . . . + (9.66) 2 
v LfL i L - c = 2.3511 


o T . . (32.06)2 + . . . + (30.44)2 

8. Date: C 

18 


1.9625 


9. Variety X date: 2.3511 - (0.1781 + 1.9625) = 0.2105 

10. Subplot error: 9.1218 - (5 6902 + 1.9625 + 0.2105) = 1.2586 


The significant differences among dates of cutting were not unexpected, 
nor were the smaller yields following B and C. The last harvest should be 
either early enough to allow renewed growth and restoration of the conse- 
quent depletion of root reserves, or so late that no growth and depletion 
will ensue. The surprising features of the experiment were two: the yield 
following C being greater than B , since late September is usually considered 
a poor time to cut alfalfa in Iowa; and the absence of interaction between 
date and variety — Ladak is slow to renew growth after cutting and might 
have reacted differently from the other varieties. These are probably 
seasonal peculiarities. 
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Such an experiment as this is, of course, not evaluated by a single 
season’s yields. The statistical methods for perennial crops are described 
in the next section. 

11.16 — Perennial plants in randomized blocks. The evaluation of 
yields from perennial plants is characterized by the continuity of the plot 
throughout the lifetime of the experiment. Formally, the analysis of 
variance duplicates that of the plankton data, each record being classified 
by plot (treatment or variety), by block and by season. The distinction is 
that the annual readings for a plot are likely correlated from year to year, 
introducing modifications in the testing of significance. 

For illustration, uniformity data on orange crops are given in table 
11.37 (15). The investigators reported the yield of each tree each season. 
On the field plan I delineated blocks of five 16-tree plots, then assigned 
at random the dummy treatments, A — E. 

Inspection of the table will reveal several instances of the consistent 
superiority of one plot over another; for example, the plot designated as 
variety A in block 2 gave a higher yield during each of the 4 years than 
that assigned to variety B in the same block. Such permanent differences 
in fertility between plots contribute to the errors of a variety total and 
require a corresponding component in the mean square appropriate for 
testing such totals. The suitable error is the variety-block discrepance 
obtained from analysis of the 2-way table shown, each entry being the 
four-year total. It is as though this were a 5 X 3 randomized blocks 
experiment with yields accumulated for its duration. The F- test properly 
indicates no population differences among the dummy varieties. 

The full analysis of the orange data is given in table 11.38 because it 
throws some light on the test just made and provides others that may be of 
interest. As before, each two-factor interaction may be tested against 
season-variety-block, the assumption being that the latter contains only 
random variation. The fact that the variety-block interaction is highly 
significant (F = 222/54 = 4.11, Fci = 3.36) emphasizes what was said 
above — the fertility differences are scarcely accidents of sampling, and 
could not safely be ignored in assessing variety yields. 

Nonsignificance of the season-variety interaction in an experiment with 
real varieties would not only be informative but would widen the basis 
of the inference made about variety differences. To see this, recall the 
components of variance in the F-ratio for testing variety: 

C 2 + 4 Gyb" ~f~ 3 Gys + \2cTv 2 
cr 2 + 4(7 vb" 

The test is ordinarily made on the assumption that the season effects are 
population parameters so that inferences apply only to this particular set 
of seasons (or seasons exactly like them). But if <r vs can be taken as zero, 
as in this experiment, and if the seasons 1924-27 can be assumed repre- 
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TABLE 11.37 

Yield (Pounds per Tree) of Washington Navel Oranges From 16-Tree-Plots 
in Riverside, California 


Season 

Block 

Variety 

A 

B 

C 

D 

E 

1924 

1 

130 

152 

118 

134 

140 


2 

171 

163 

164 

165 

146 


3 

159 

173 

170 

164 

164 

1925 ! 

1 

132 

125 

119 

119 

126 


2 

146 

138 

149 

146 

126 


3 

150 

143 

140 

142 

140 

1926 

1 

93 

109 

85 

95 

80 


2 

122 

108 

122 

123 

104 


3 

112 

120 

124 

104 

121 

1927 

1 

172 

158 

170 

164 

160 


2 

186 

157 

166 

V6 

148 


3 

158 

174 

162 

161 

175 


Totals for 4 Seasons 


Block 

Variety 

Mean 

A 

B 

C 

D 

E 

1 

527 

544 

492 

512 

506 

129 0 

2 

625 

566 

601 

610 

524 

146 3 

3 

579 

610 

596 

571 

600 

147.8 

Mean 

144 2 

143 3 

140 7 

141 1 

135 8 

141 05 


Analysis of Variance for Testing Varieties 


Source of Variation 

Degrees of Freedom 

Mean Square 

Variety 

4 

128 

Block 

2 


Variety-block 

8 

222 


F « 128/222 * 0.58 


sentative of seasons in general, then the test of <j v = 0 may be applied 
to all such seasons. 

The contents of sections 11.13-11.15 should make it plain that testing 
and interpreting complex experiments cannot be reduced to any simple 
set of rules. Illustrations have been drawn from the more ordinary types 
of experiments. If you find it necessary to use less common designs, it may 
be desirable to consult a mathematical statistician with experience in your 
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TABLE 11.38 

Analysis of Variance of Orange Yields 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Variety 

4 

513 

128 

Block 

2 

4,342 

2,171 

Variety-block 

1 8 

1,780 

222 

Season 

3 

28,407 

9,469 

Season-variety 

1 12 

425 

35 

Season-block 

6 

1,456 

243 

Season-variety-block 

24 

1,306 

54 

Total 

59 

38,229 


line of research. N.B : do this before your experiment starts, not after it is 


finished. 

11.17 — Correlation between mean and variance. A fundamental 
condition for the validity of the F-test of significance is that the two mean 
squares be independent. If one is conditioned by the other in any manner, 
the test is invalid. The popularity of analysis of variance has resulted in 
its application to data in which this condition is not met; specifically to 
percentages or frequencies of occurrence of some such event as infestation 
by a parasite. These tend to be distributed in the manner to be described 
in chapter 16, especially if the event is infrequent. The variance is then 
closely related to the mean. Hence, if samples are drawn from several 
such distributions, the mean squares in F will not be independent. This 
decreases the reliability of the test of significance. 

You may be surprised by the suggestion that enumeration data can be 
treated with methods appropiiate to measurement data. Heretofore the 
two kinds have been kept quite distinct. (For exceptions, recall examples 
8.1, 8.4, and 8.8 ) There arc many instances, however, in which it is 
either convenient or necessary to pass from one set of methods to the other, 
as when chi-square was used to test the homogeneity of variance in section 
10.13. As experimentation grows more complicated the flexibility of 
analysis of variance is of increasing advantage, making desirable the 
transformations that allow its application to enumeration data. Consider 
for example, a randomized block or latin square experiment containing 
several plots treated with some disinfectant. Chi-square may be used 
to test the homogeneity of the percentages of infestation; and the means 
for two disinfectants may be similarly tested if each is based on plots with 
uniform infestation. But such results are rare. Even though the plots 
are treated alike they usually differ significantly in number or percentage 
of plants attacked. It would not be correct, then, to test the means by 
use of chi-square. The present trend is toward analyzing the variance of 
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such enumeration data after a transformation that makes variance inde- 
pendent of means. 

Much experimental data, however, whether expressed in percentages 
or frequencies, may be safely subjected to variance analysis. Percentages 
which express frequency of occurrence per hundred units when calcu- 
lated from counts of 100 or more in the numerator and when ranging be- 
tween 20% and 80%, may be expected to yield valid tests in analysis of 
variance. Similarly, counts of infested plants in experimental plots, for 
example, running into three figures, usually offer no difficulties. 

On the other hand, if percentages result from less than 100 affected 
individuals, or if the event enumerated is infrequent, some transformation 
of the variable may be necessary before analysis of variance is carried 
through. Methods will be discussed in section 16.7. 
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Chapter 12 


TWO VARIATES IN TWO OR MORE GROUPS. 
COVARIANCE 


12.1 — Introduction. Covariance is a term applied to a statistical 
method used to reduce the data from experiments involving two or more 
variates measured in several groups. In this chapter the method will be 
applied to two variates only. In that respect we shall be occupied in 
extending the methods of regression and correlation as developed in 
chapters 6 and 7 to include more than one group of data. In the use of 
correlated data we shall be broadening the methods of analysis of variance 
treated in the last two chapters. The use of the word covariance to desig- 
nate a statistical method as well as a mean product (section 6.5) need 
cause no confusion. 

12.2 — An experiment with a single criterion of classification. The 

rats whose food intake and gain in weight are recorded in table 12.1 were 

TABLE 12 1 

Food Intake, X (10-Calorie Units), and Gain in Weight, Y (Grams), of 60 Rats 
Receiving Six Rations in Lots of 10 









Lot 








1 


2 


3 


4 


5 


6 


Rat 

X 

Y 

X 

Y 

.V 

Y 

X 

Y 

X 

) 

X 

Y 


1 

108 

73 

99 

98 

194 

94 

165 

90 

124 

107 

140 

49 


2 

136 

102 

117 

74 

198 

79 

164 

76 

95 

95 

177 

82 


3 

138 

118 

90 

56 

196 

96 

161 

90 

116 

97 

189 

73 


4 

159 

104 

141 

111 

198 

98 

159 

64 

112 

80 

142 

86 


5 

146 

81 

106 

95 

210 

102 

175 

86 

123 

98 

216 

81 


6 

141 

107 

112 

88 

196 

102 

135 

51 

no 

74 

200 

97 


7 

175 

100 

110 

82 

230 

108 

132 

72 

137 

74 

255 

106 


8 

149 

87 

117 

77 

222 

91 

190 

90 

105 

67 

173 

70 


9 

174 

117 

111 

86j 

220 

120 

145 

95 

155 

89 

153 

61 


10 

176 

111 

122 

92 

228 

i 

105 

142 

78 

126 

58 

160 

82 

Sums 

1,502 1,000 

1,125 

859 2,092 

995 

1,568 

792 

jl , 183 

839 

1,805 

787 

SX 1 


229,760 

128,245 

439,544 

248,886 

141,525 

337,433 


SY* 

102 

,062 

75 

,819 

100,075 

64,462 

72 

613 

64,401 

SXY 

151,846 

97,776 

208,892 

125, 

370 

99,195 

145,872 


Experiment totals: ,VA' = 9,275 - 5,272 SXY - 828,951 

SX 2 = 1,525,393 SY 2 * 479,432 


[ 318 ] 
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classified according to the single criterion, ration fed. The object is to 
learn if the six foods were associated with differences in gain. It was recog- 
nized that such differences might be caused either by unequal avail- 
ability of the nutritive elements in the food or by variations in palatability. 
The investigation of the relations between two variates such as food in- 
take and gain, occurring in two or more groups, calls for the methods of 
covariance. 

We shall first ask these questions: did the six rations induce significant 
differences among the lot gains, and how were these gains conditioned by 
food intake? The first question is easily answered by the straightforward 
analysis of variance in the manner of chapter 10. Table 12.2 indicates 


TABLE 12.2 

Analysis ok Variance of Rat Gains in Weight 


Source of Variation 

Degrees of Freedom 

Mean Square 

Rats within lots 

54 

215 

Rations 

5 

923** 

F = 923/215 - 4.3 


highly significant differences among the lot gains. The answer to the 
next question is the subject matter of this chapter on covariance. 

12.3 — The test of significance of differences among adjusted group 
means. Since this test is of general utility and fairly easy to make, the 
mechanism will be presented now leaving explanations till later. The 
first step consists in the calculation of the sum of squares ot errors of esti- 
mate for the experiment as a whole, using the method of chapter 6. The 
experiment totals at the bottom of table 12.1 lead to the following: 

Correction terms: For X, (9,275) 2 /60 = 1,433,760.42 
For Y, (5,272) 2 /60 = 463,233.07 
For AT, (9,275) (5,272) / 60 = 814,963.33 

Sums of squares: Sx 2 = 1,525,393 — 1,433,760.42 = 91,632.58 
Sf = 479,432 - 463,233.07 = 16,198.93 

Sum of products: Sxy = 828,951 — 814,963.33 = 13,987.67 

The sums of squares and products, entered in the first line of table 12.3, 
are used to compute the total sum of squares of errors of estimate (section 
6 . 8 ): 

Sf - ( SxyY/Sx 2 = 16,198.93 - (13, 987.67) 2 /91, 632.58 =|14, 063.72 
The degrees of freedom are two less than the entire number of rats. 
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TABLE 12.3 

Analysis of Covariance and Test of Significance of Adjusted Lot Means 



1 

Sums of Squares and Products 

Errors of Estimate 

Source of 
Variation 

Degrees 

of 

Freedom 

Sx* Sxy Sf 

Degrees 

Sum of of Mean 

Squares 1 Freedom Square 

Total 

59 

91,632 58 13,987.67 16,198 93 

14,063.72 58 

Rations 

5 

67,662.68 5,520 97 4,612.93 

Within lots 
(error) 

54 

23,969 90 8,466 70 11,586 00 

i 

8,595.37 53 162.2 


For test of significance of adjusted means 5 , 468 .35 5 1, 093 .7** 


i sy - 


(%) 5 

Sx* 


F - 1,093.7/162.2 * 6.74 


We next compute sums of squares and products characterizing the 
regression of the lot mean gain on lot mean food intake. The calculations 
are in the forms typical of analysis of variance, employing the lot sums: 


Food intake: 


(1,502) 2 + . . . + (1,805) 2 
10 


1,433,760.42 = 67,662.68 


Gains: 


(1,000) 2 + . . . + (787) 2 
10 


- 463,233.07 = 4,612.93 


„ , (1,502)(1, 000)+... + (1,805)(787) 

Products: 

10 


814,963.33 = 5,520.97 


These quantities entered in line 2 might be used to calculate the sum of 
squares of errors of estimate of lot mean gains measured from the regression 
of means. That will be done later, but it is unnecessary for the present 
test. Instead, the sums of squares and products are merely subtracted from 
the corresponding quantities in line 1 to get those of line 3. These last 
can be calculated directly from the data of the individual lots, in the man- 
ner of table 12.6, thus completely verifying the computations. At present 
we use them in a regression within lots similar to the total first calculated. 
As will be shown later, this regression is an average of the individual lot 
regressions. From it have been eliminated any effects of the differences 
among the lot means themselves. The associated sum of squares of errors 
of estimate, 8,595.37, is calculated as usual. Since this sum is made up of 
uncontrolled variations in gain after due allowance is made for food intake , it is 
the basis for an estimate of experimental error. The accompanying de- 
grees of freedom are 54 — 1 = 53, the one deducted belonging to the 
last regression computed, that within lots. 
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The difference in the third-to-last column of the table, divided by 
the corresponding degrees of freedom, 5,468.35/5 = 1,093.7, is the mean 
square appropriate for testing the significance of the differences among 
the adjusted lot mean gains. Since F is highly significant the conclusion 
is that the quantity of food intake does not explain the differences among 
the lot gains: after these gains are adjusted to a common food intake 
basis they still differ significantly. Evidently the nutritive materials in 
these foods are not equally adapted to assimilation by the rats. Without 
the covariance analysis, one would not know whether the differences in 
gain were attributable to quantity ol nutritive materials eaten or to their 
quality. 

One might or might not get similar results by first reducing each gain 
to a per-calorie basis, then analyzing the variance of gain per calorie. Unless 
previous investigations have established the validity of the analysis of the 
ratios, it is safer to use the covariance method. Besides, it furnishes a 
wealth of information which would be lost if only gain per calorie were 
considered. This will be increasingly evident as the subject unfolds. 

EXAMPLE 12.1 — This is a set of numbers designed for easy computation: 


Group 



! 1 

2 

3 

4 

Item 

; x 

Y 

X 

Y 

X 

Y 

X 

Y 

1 

1 29 

22 

15 

30 

16 

12 

5 

23 

2 

20 

22 

9 

32 

31 

8 

25 

25 

3 

14 

20 

1 

26 

26 

13 

16 

28 

4 

21 

24 

6 

25 

35 

25 

10 

26 

5 

6 

12 

19 

37 

12 

7 

24 

23 


Verify the following: 



Errors of Estimate 


Degrees oi 

Mean 

Source of Variation 

Freedom 

Square 

Error 

15 

16.7 

For testing significance of adjusted means 

3 

307 7 


12.4 — The adjusted lot means and a question in design. It is 

instructive to calculate the adjusted lot mean gains, then compare them 
with the unadjusted. You may recall the formula for the adjusted values 
(section 6.7), Y — bx. Compute the error regression coefficient from table 
12.3, 

, Sxy 8,466.70 . 

b = — - = 0.3532 grams per unit 

£v 2 23,969.90 
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From table 12.1, x = 9,275/60 = 154.6 units, while the lot sums readily 
yield the lot means. The resulting information is put together in table 
12.4. In the main the adjusted and actual mean gains are not greatly 

TABLE 12.4 

Calculation of Adjusted Mean Gains of Rats 
£ «■ 154.6 grams b ** 0.3532 grams per unit food 


Lot 1 

Mean Food 
Intake 

X 

Deviation 

From 

Experiment 

Mean 

X 

Product 

bx 

Mean Gain 

Y 

Adjusted 
Mean Gain 

Y - bx 

1 

150 2 

- 4.4 

- 1 .6 

100 0(1)* 

101 6(1)* 

2 

112 5 

-42 0 

-14 8 

85 9(3) 

100 7(2) 

3 

209 2 

54 6 

19 3 

99.5(2) 

80.2(4) 

4 

156 8 

2 2 

0 8 

79 2(5) 

78 4(5) 

5 

118 3 

-36 3 

-12 8 

83 9(4) 

96 7(3) 

6 

180 5 

25 9 

9 1 

78 7(6) 

69.6(6) 

Sums 

927 5 

0 0 

0 0 

527 2 

527 2 


* Rank. 


discrepant. The high- and low-ranking lots are the same in both lists. The 
major shift is in the lots ranking third. With a knowledge of the food 
compositions, the nutritionist might draw some conclusions from the 
changes in rank; the statistician can only point them out. 

The graphical representation of figure 12.1 may be of assistance. 



Fio. 12.1 — Average regression of gain on food intake in six lots of rats. The dots 

represent the lot means. 
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The regression equation within lots is 

f = 87.87 + 0.3532 (X - 154.6) 

= 0.3532A" + 33.27 

% 

This is plotted in the figure along with the six points representing lot 
means. What we have learned is that these points deviate significantly 
from the line. The amounts of food consumed do not account for the lot 
gains. 

The investigator will wish to know if it has been worthwhile to intro- 
duce the independent variate into the experiment. Three items of infor- 
mation will assist him in making judgment. First, the list of actual and 
adjusted means. Sometimes the rank order of adjusted means is quite 
different from that of the unadjusted, and the shifts may be interpreted. 
Second, a comparison of the sum of squares of errors of estimate (table 
12.3) used for testing significance, 5,468.35, with Sy 2 = 4,612.93. In this 
example the former is greater than the latter, but the opposite may occur. 
Any notable contrast in the sizes of these sums of squares is informative. 
Third, the change in the precision of the experiment owing to adjustment 
of the error sum of squares. The changes taking place in this experiment 
are clearly indicated in table 12.5. The sum of squares, Sy 2 = 11,586, 

TABLE 12 5 

Analysis of Error Variance in Rat Data 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Within lots, unadjusted gams, Sy 1 

54 

11,586 

215 

Reduction clue to regression, {Sx)') 2 /lSa 2 

1 

2,991 

2,991** 

Error for adjusted gains j 

53 

8,595 

162 


F « 2,991/162 - 18 


with 54 degrees of freedom is analyzed into two parts, one with a single 
degree of freedom measuring the variation attributable to linear regres- 
sion, the other with 53 d.f. being assigned to error. Not only is the mean 
square for error reduced from 215 to 162, but the reduction in sum of 
squares due to regression is highly significant. This reveals a valuable 
increase in the precision of the experiment. A consideration of these three 
features of the covariance method will enable the investigator to decide 
whether to retain the independent variable in similar experiments. 


EXAMPLE 12.2 — In the following table are initial weights X (pounds) and average 
daily gains Y (pounds per day) of four lots of swine fed different rations. For ease in 
computation these data are somewhat oversimplified. (You will remember that there 
should be ranges of from 25 to 40 in the significant digits of each variate.) Make the 
necessary calculations to test the significance of the differences among the lot mean 
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gains ( F = 15.5) as well as among the adjusted mean gains (F * 19). For your 
guidance, some results are appended to the data. 


Pig Number 

Lot 1 

Lot 2 

Lot 3 

Lot 4 

Weight 

X 

Gain 

Y 

Weight 

X 

Gain 

Y 

Weight 

X 

Gain 

Y 

Weight 

X 

Gain 

Y 

1 

62 

1 2 

79 

2 0 

71 

1.2 

61 

1 4 

2 

73 

1 4 

65 

1 8 

60 

1.3 

59 

1 8 

3 

58 

1.3 

57 

1 6 

54 

1 4 

59 

1 6 

4 

43 

1 3 

51 

1 8 

50 

1 4 

53 

1 5 

5 

50 

1 4 

57 

1 9 

60 

1 2 

56 

1.7 

6 

44 

1 2 

66 

1 5 

61 

1 2 

50 

1 5 

7 

48 

1 0 

44 

1 6 

44 

1 2 

45 

1 4 

8 

51 

1 6 

41 

1 5 

53 

1 0 

39 

1 4 

9 

40 

1.2 

44 

1 8 

41 

1.1 

38 

1.3 

10 

38 

1.1 

36 

1 3 ! 

38 

1.1 

45 

1 3 


507 

12 7 | 

. 

540 

16 8 

532 

12 1 

505 

14.9 


Source of 
Variation 

Degrees 

ol 

Freedom 

Sums of Squares and Products 

Errors of Estimate 


Sxy 


Degrees 

Sum of of Mean 

Squares Freedom Square 

Total 

39 

4,355 60 

37 55 

2 464 

2 1403 38 

Lots 

3 

93 40 

3 61 

1 389 


Error 

36 

4,262 20 

33 94 

1 075 

0 8047 35 0 023 


Difference for testing adjusted means 1 . 3356 

3 

0 445** 

F = 0.445/0.023 = 19 




EXAMPLE 12.3— The data in example 12 2 arc taken from an experiment (3) in 
which the attempt was made to forecast the gains of the pigs as a basis for assignment 
to the lots. The four pigs numbered 1 were expected to gain at the same rate if they 
had been fed the same ration. That is, the pig numbers roughly lepresented levels of 
gaining ability. One is warranted, then, in introducing a second criterion of classifica- 
tion, making the experiment essentially a randomized block design like that of section 
1 1.2. The sum of the four gains in each of the 10 lines constitutes the block sum. It is 
easy to compute this table. 


Source of Variation 

Degrees of Freedom 

Mean Square 

Blocks or outcome groups 

9 

0 0558* 

Lots or rations 

3 

0 4630** 

Error 

27 

0 0212 


F - 0.4630/0 0212 » 22 


You can now make some comparisons between the two designs for experiments. If 
you know enough about your experimental material to classify it into outcome groups, 
you can segregate the sum of squares for such groups by the methods of chapter 11, 
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thus avoiding covariance. You will notice that the mean square for outcome groups is 
significant; also that Ffor lots is larger than in the table of example 12.2. On the other 
hand, the error mean square now has only 27 d.f ., and you have sacrificed all the extra 
information available from the covariance analysis. Both designs are excellent. In 
situations like the one under discussion, the investigator may choose between them. 
The object of this example is to contrast the possibilities which they offer. 

12.5 — Covariance in the groups. In experiments expressly designed 
for covariance, the foregoing test of significance is usually the end of the 
statistical investigation. For two reasons, however, it is worthwhile to 
examine somewhat minutely the covariance in the several groups, to- 
gether with the relations among these groups. First, it is desirable to gain 
a knowledge of the structure of the test we have just learned to make. 
In the second place, the statistician is often called upon to analyze sets of 
data in which covariance had not been contemplated. The relations 
among the groups of data may be more pertinent than any single test of 
significance. 

Let us return to the rat data in table 12.1 and calculate the regression 
in each lot. The results arc entered in table 12.6. In each line the devia- 

TABLE 12 6 

Regression and Correlation Data in Six Lots of Rats 


Lot 


Degrees 

of 

Freedom 


1 9 

2 9 

3 9 

4 9 

5 9 

6 9 


Sums 54 


Sums of Squares and Products 


£* 2 


Sxy 



sy 


4,159 

60 

1,646 

00 

2 

062 

00 

1,682 

50 

1,138 

50 

2 

030 

90 

1,897 

60 

738 

00 

1 

072 

50 

3,023 

60 

1,184 

40 

1 

735 

60 

1 , 576 

10 

— 58 

70 

2, 

220 

90 

11,630 

50 

3,818 

50 

2 

464 

1° 

23,969 

90 

8,466 

70 

11. 

586 

ooj 




Errors of 



Estimate 

Correia- 

Regres- 



— 

tion Co- 

sion Co- 

Sum of 


efficient 1 

efficient 2 

Squares 3 

il 

0 5620 

0 3957 

1,410 66 

8 

0 6159 

0 6767 

1 ,260 51 

8 

0 5173 

0 3889 

785 48 

8 

0 5170 

0 3917 

1,271 65 

8 

-0 0314 

-0 0372 

[2,218 71 

8 

0 7133 

0 3283 

j 

1 .210 42 

8 

0 5081 

0.35321 

1 

8,157 43 

48 


1 Sxy/ \/(Sx*)W) 2 Sxy/Sx* * 6> 2 - (, Sxy)*/Sx 2 


tions whose sums of squares and products are set down arc measured from 
the lot mean, not from the experiment mean as in table 12.3. That implies 
a set of corrections for each lot, the set for lot 1 being: 

For X: (1,502) 2 /10 = 225,600.40 
For Y: (1,000)710 - 100,000.00 
For AT: (1,502)(1,000)/10 - 150,200.00 
The sums of squares and products for lot 1 follow from applying these 
corrections to the totals in the last three lines of table 12 1: 

For Jot 1: A* 2 = 229,760 - 225,600.40 = 4,159.60 
Sy 2 = 102,062 - 100,000.00 - 2,062.00 
Sxy - 151,846 - 150,200.00 = 1,646.00 
These and similar results are entered in the table. 



326 


Statistical Methods 


Section 12.5 


Since we arc going into the data rather thoroughly, it is of interest to 
compute correlation and regression coefficients as well as the sums of 
squares of errors of estimate. The values of each set of coefficients differ 
considerably among themselves. One of our jobs is to learn if these differ- 
ences are significant, or if they represent merely sampling variation. 
Another is to consider a way of averaging these coefficients. At the mo- 
ment, let us compute and plot in figure 12.2 the regression equations for 
the six lots. That for lot 1 is 

f = 100.0 + 0.3957 (X - 150.2) 

= 0.3957A' + 40.57, 

the means coming directly from the sums in table 12.1. We arc to learn 
if the slopes of these lines differ significantly. 



Fig. 12.2— Regressions of gain on food intake in six lots of rats. The average of these 
regressions is plotted in figure 12 1. 


A notable feature of table 12.6 is this: the totals in the three columns 
Sx 2 y Sxy , and Sy 2 are the same as the differences in the third line of table 
12.3. This fact furnishes a perfect verification of the calculations in the 
two tables. But, more important, it gives a clue to the character of the 
sums of squares and products within lots in the earlier table. It is now 
evident that they are the pooled values from all the groups, each deviation 
involved being measured from its own group mean. Hence, the within lot 
regression of table 12.3 is an average of the six lot regressions of table 12.6. 
For comparison, note that this regression, computed in section 12.4, is 
0.3532 grams per unit food intake, intermediate in value among the lot 
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regressions. Thus we have been successful in our quest for an average of 
the individual regressions of table 12.6. 

An interesting contrast appears from a comparison of the within lot 
sum of squares of errors of estimate in table 12.3 with the sum of those in 
the last-but-one column of table 12.6. They are put in juxtaposition in 
table 12.7. Those in table 12.6 are sums of squares of deviations from 


TABLE 12.7 

Analysis of Errors of Estimate From Average Regression Within Groups 


Source of Variation 

Degrees of 
Freedom 

Errors of Estimate 

Sum of Mean 

Squares Square 

Deviations from average (error) regression 
within lots, table 12.3 

53 

8,595 37 


Deviations from individual lot regressions, 
table 12.6 

48 

8,157 43 

169 9 

Differences among lot regressions 

5 

437 94 

87 6 


individual regressions in the several lots. According to the principle of 
least squares each of these sums is less than that from any other straight 
line; hence, each is less than the sum of squares of the deviations from the 
average regression of table 12.3. That explains why the second sum in 
tabic 12.7 is less than the first. It turns out that the difference between 
them, 437.94, with 5 degrees of freedom is associated with the variation 
of the lot regression coefficients The corresponding mean square, 87.6, 
is to be tested for significance. For the test, the mean square deviation 
from individual lot regressions is a proper comparison. This mean square 
is an average of that variation in gains not explained either by ration 
differences or by the individual regressions, analogous to discrepance in a 
randomized block experiment. It is therefore attributable to experimental 
error. When you attempt the test, you find that the mean square for 
differences among regressions is actually less than error. The test is quite 
unnecessary, the lot regressions being more nearly alike than would be 
expected half the time in sampling from a homogeneous population. 

We have thus reached the second of our immediate objectives, a test 
of the significance of the differences among the group regression co- 
efficients. In a sense, we got the trailer in front of the car, averaging the 
regression coefficients before we showed that they do in fact belong to a 
comrqon population. Since the average turns out to be legitimate, we 
may revise our opinions to some extent. We may properly look upon the 
average regression as the best available estimate of the population regres- 
sion and may leave behind the individual regressions as being merely 
items in the larger sample. For further testing we may, as in table 12.3, 
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use the mean square 162.2 as the estimate of error. Active interest in the 
individual regressions almost vanishes with proof that they do not differ 
significantly. 

Having arrived at such conclusions, we may consider quite properly 
an average of the correlation coefficients in table 12.6. Since they may be 
samples from a homogeneous population, their average is computed from 
the sums of squares and products associated with the average regression, 
the result being 0.5081, the same as may be got by section 7.6 This is 
larger than the experiment (total) correlation, 0,3631, chiefly because of 
eliminating from the denominator the large sum of squares attributable 
to variation of the lot means. 

This detailed examination into the individual lot regressions yields two 
more bits of information sometimes well worth while. We return to table 
12.3 and compute the regression averages for the ration means: 

1. r = 5, 520.97/vV>7, 662.68) (4,61 2.93) = 0.3125 

2. b = 5,520.97/67,662.68 = 0.08160 grams per unit food 

3. Sum of squares of errors of estimate 

= 4,612.93 - (5,520.97)767,662.68 = 4,162 44 

The small reduction in sum of squares shows that there is no pronounced 
trend. Lot mean gains do not increase regularly with mean food intake, 
but are decidedly erratic. The regression equation for lot means is 

f = 87.87 + 0.08160 (A' - 154.6) 

= 0.08160A’+ 75.25 

This equation may be plotted in figure 12.1 for contrast with the average 
regression and the lot means. It is easy to see, however, that the lot means 
follow this regression no better than that within lots. A formal test is 
supplied in table 12.8. Here is entered a sum of squares of errors of 
estimate computed in each of the first three lines of table 12.3. The mean 
square for lot means is highly significant as compared with the average 
regression within lots. This verifies the conclusion reached above from an 
examination of the reduction in sum of squares. The variation in these 
lot mean gains is independent of their regression. 


TABLE 12.8 

Analysis of Errors of Estimate From Three Regressions 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Total from table 12,3 

58 

14,063 7 


Lot means 

4 

4,162 4 

1,040 6** 

Average within lots, table 12.3 

53 

8,595 4 

162 2 

Remainder 

1 

1,305.9 

1,305.9** 
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There remains a single degree of freedom in table 12.8, with an asso- 
ciated mean square, 1,305.9, that occasionally has an interesting interpre- 
tation. It represents the difference between two regressions, that of lot 
means (0.0816) and the average within lots (0.3532). If the mean square 
in the second line of the table had been nonsignificant, this difference 
might have had a clear meaning. Its significance would have indicated 
that the gains of individuals within the groups followed a trend different 
from that of group means. In the present experiment where there is no 
pronounced trend of means, it is idle to bother about the difference be- 
tween this and any other trend. Only if there are well defined trends both 
among individuals within lots and among lot means is this degree of 
freedom likely to yield interpretable information. 


EXAMPLE 12.4 — The analysis of the errors of estimate for example 12 1 is: 



Degrees of 

Mean 

Source of Variation 

Freedom 

Square 

Deviations from individual regressions 

12 

16 32 

Regression coefficients 

3 

18 40 

Group means from their own regression 

2 

10 60 

Remainder 

i 

1 

901.86 


For an explanation, represent graphically the original data. 


EXAMPLE 12.5 -“Complete the covariance analysis of the lots of pigs in example 
12.2 The results are: 


.Source of Variation 


Within lots, pooled regression 
Within lots, individual lot regressions 

Lot regression coefficients 
Lot means 

Difference between regtessions within lots 
and between lot means 


Degrees of Mean Square 

Freedom Errors of Estimate 


35 0 0230 

32 0.0235 

3 0 0179 

2 0 6248** 

1 0.0861 


This indicates uniformity of behavior in all parts of the experiment save only in lot 
means Their differences are evidently the result of the rations, being unexplained 
by variations in initial weight. This is merely additional information — the proper test 
is that of example 12.2. 

EXAMPLE 12.6 — For the pigs compute the adjusted lot means, 1.28, 1.67, 1.20, 
1.50. 

EXAMPLE 12.7 — Calculate the regression within lots of pigs, 
f - 0.00796* + 0 997, 

and plot it on the same graph with the lot means, showing the manner in which these 
means depart from regression. 
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12.6 — Covariance in two groups only. If there are only two groups 
in an experiment, either or both of two questions may be asked : are there 
significant differences first, between the two adjusted group means, and 
second, between the two group regression coefficients? This is merely a 
special case of the method developed in the last section but interesting 
enough to be illustrated. We turn again to the rat data with the addi- 
tional information that the first three lots were fed rations high in protein 
contrasted with a low level in the last three. From table 12.1 it is easy to 
accumulate the data of table 12.9. For the experiment as a whole the 
original corrections for mean are used, but new ones must be calculated 

TABLE 12.9 

Data on Food Intake and Gain in Two Groups of the Rats of Table 12.1 Fed 

Two Levels of Protein 


Level 

SX 

SY 

SX 2 

SXY 

SY 1 

High 

4,719 

2,854 

797,549 

458 ,514 

277,956 

Low 

4,556 

2,418 

727,844 

370,437 

201,476 

Total 

9,275 

5,272 

1,525,393 

828,951 

479,432 


for the two levels. Corrected sums of squares and products are recorded 
in table 12.10. The various coefficients and sums of squares of errors of 
estimate are computed as for table 12.6. From these data the two desired 
tests are made in tables 12.11 and 12.12. The adjusted mean gains at the 

TABLE 12 10 

Regression and Correlation D*ta in Two Groups of Rats 


Level of 
Protein 

De- 

grees 

of 

Free- 

dom 

Sums of Squares and Products 

Corre- Regres- 
sion sion 

CocfTi- Coeffi- 
cient 1 cient 2 

Errors of 
Estimate 

Sx 2 Sxy Sy 2 

De- 

grees 

of 

Sum of Free- 
Squares 3 dorn 

High 

Low 

29 j 
29 

55,250 30 9,579 80 6,445 47 
35,939 47 3,223 40 6,585 20 

0 5076 0 1734 
0 2095 0 0897 

4,784 44 28 
6,296 09 28 

Sum 




11,080 53 56 

Average 

within 

levels 

t 58 

i i 

91,189 77 12,803 20 13,030 67-0 3714 0 1404 

11,233.08 57 


Ljg y __ % jx, _ (s xy y 

V&*)W) Sx * y s* 
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TABLE 12.11 

Test of Significance of Difference Between Adjusted Mean Gains 
at Two Levels of Protein 



Errors of Estimate 


Degrees of 

Sum of 

Mean 

Source of Vaiiation 

Freedom 

Squares 

Square 

Total from table 12.3 

58 

14,063 72 


Average within levels 

57 

11,233 08 

197 

Between adjusted means 

1 

2,830 64 

2,831** 


Adjusted mean gains: high level, 94.7, low level, 81.0 


two levels of protein intake differ significantly, but the regressions are 
remarkably similar. 

A striking contrast between tables 12.12 and 12.8 is the absence of the 
two sources of variation, lot means and remainder. The remainder is, in fact, 
the single degree of freedom between regression coefficients in table 12.12. 
There can be no deviation from the regression of means — since there are 
only two of them they determine the two points that fix the regression. 
There is neither degree of freedom nor sum of squares for testing this 
regression. 

EXAMPLE 12.8— Compute the regression equations for the two levels: 

high: f - 0 1734* + 67 82 
low: f - 0 0897 A” + 66.97 

Plot these on a graph, then plot the six lot means on the same graph. 

12.7 — Covariance in a randomized block experiment. When it is 
necessary to use statistical control in a randomized block experiment, the 
test of significance of the differences among the adjusted means is easily 
made. In table 12.13 are data from an experiment of this type. The 


TABLE 12.12 

Ti si of Significance Between Regression Coefficients at High and 
Low Levels of Protein 


-- . 

- - — - 

- — - . 

--- ~ 



Errors of Estimate 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Average within levels 

Deviations from individual 

57 

11,233 


level regressions 

56 

11,081 

198 

Between regression coefficients 

1 

i 

152 

152 
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TABLE 12.13 

Numbers of Beets per Plot and Yields (Tons per Acre) in a 
Randomized Block Experiment 


Fertilizer 

Applied 

Number 

and 

Yields 

Block 

Treat- 

ment 

Sums 

1 

2 

3 

4 

5 

6 

None 

Number 

Yield 

183 

2.45 

176 

2.25 

291 

4 38 

254 

4 35 

225 

3.42 

249 

3.27 

1,378 
20 12 

Superphos- 
phate, P 

Number 

Yield 

356 

6 71 

300 

5 44 

301 

4 92 

271 

5 23 

288 

6 74 

258 

4.74 

1,774 
33 78 

Muriate of 
potash, K 

Number 

Yield 

224 

3 22 

258 

4 14 

244 

2 32 

217 

4.42 

192 

3.28 

236 

4 00 

1,371 

21 38 

P + K 

Number 

Yield 

329 

6 34 

283 

5 44 

308 

5 22 

326 

8 00 

318 
6 96 

318 

6 96 

1,882 
38 92 

P + sodium 
nitrate, N 

Number 

Yield 

371 

6 48 

354 

7 11 

352 

5 88 

331 

7 54 

290 
6 61 

410 

8 86 

2.108 
42 48 

K + N 

Number 

Yield 

230 

3 70 

221 

3 24 

237 

2 82 

193 

2 15 

247 

5 19 

250 

4 13 

1 ,378 
21 23 

P + K + N 

Number 

Yield 

322 

6 10 

367 

7 68 

400 

7 37 

333 

7 83 

314 

7.75 

385 

7.39 

2,121 
44 12 

Block sums 

■ 

Number 

Yield 

2,015 
35 00 

1,959 
35 30 

2,133 
32 91 

1,925 
39 52 

1,874 
39 95 

2,106 

39.35 

12,012 
222 03 


object of the investigation was to learn the effect of certain fertilizers on 
the yield of sugar beets. But the number of plants differed from plot to 
plot. It was necessary, therefore, to examine the effect of the variable 
stand and to estimate the yields on the basis of equal numbers of beets. 
For the two variates, number of beets A r and yield F, the sums of the 
recorded values arc entered in the table. For a complete check, these 
should be run up for both treatments and blocks. From the experiment 
sums the three corrections for mean are calculated: 


For X: 3,435,432 
For F: 1,173.7457 
For AT: 63,500.58 


From the experiment sums of squares and products, the values of 
Sxy> and »SV 2 are computed and entered in the first line of table 12.14. The 
corresponding results for blocks in line 2 are: 


Sx* - 


(2,015) 2 + . . . + (2,106) 2 


3,435,432 - 7,472.57 


7 
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(35.00) 2 + . . . + (39. 35) 2 

Sf = - 1,173.7457 = 6.3134 


% - _ 63,500.58 - —116.56 


The third line, treatments, comes from similar computations applied to 
the treatment sums, the denominator now being 6. the number of items 
entering into each sum. Next the numbers in lines 2 and 3 are deducted 
from the line of totals. The remainder corresponds to the variation com- 
monly ascribed to error, the discrepance among the adjusted plot yields. 


TABLE 12.14 

Analysis of Covariance of Stand and Yield of Sugar Beets, Together 
With Test of Significance 



I 

Sums of Sq 

uares and Products 

Errors of Estimate 


Degrees 




Degrees 

Source of 

of 




Sum of of Mean 

Variation 

Freedom 

[ A'** 

Sxy 

Sr* 

Squares Freedom Square 

Total 

41 

152,158 00 

4,163 69 

142.4022! 


Blocks 

5 

7,472 57 

— 116.56 

6 3134; 


Treatments 

6 

116,020 33 

3,598 05 

112 8562 


Error 

30 

28,665 10 

682 20 

23 2326 

(, 9969 29 0 241 

Treatment 






-f error 

36 

144,685 43 

4,280 25 

136.0888 

9 4655 35 


Difference for testing adjusted treatment means 2 4686 6 0 41 1 


F « 0 411/0 241 = 1.71, Pn - 2.43 


Let us pause to analyze the variance of the yields as in table 12.15. 
The block differences appear nonsignificant, apparently indicating un- 
usually uniform field conditions. The yields, however, varied decidedly 
with treatment. We wish lo know how these yields are related to stand. 

Returning to table 12.14, the next step is different from any taken 
before. Since the total in line 1 has been subdivided into more than two 


TABLE 12 15 

Analysis of Variance of Yields Y of Sugar Beets With Seven Treatments 
Data taken from table 12.14 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Blocks 

5 

6 3134 

1 26 

Treatments 

6 

112 8562 

18 81** 

Error 

30 

23 2326 

0.774 
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parts, it is necessary to form a new total containing only the two effects 
we are interested in — treatment and error. This new total is in the line, 
treatment + error , degrees of freedom, sums of squares and products all 
being added to get the appropriate numbers. The remainder of the test- 
ing is like that of table 12.3. Sums of squares of errors of estimate are 
calculated by use of the usual formula in lines error and treatment + error. 
The former is subtracted from the latter to yield the sum of squares, 
2.4686, appropriate for testing significance. You see that the adjusted 
treatment means do not differ significantly. When this result is compared 
with that of table 12.15, it is clear that the yield differences that seemed 
to be due to treatment were largely the result of differences in stand. The 
average weight of the individual beets wasn’t greatly affected by the 
fertilizers. 

There is the possibility that the differences in stand were themselves 
the result of the treatments. For evidence, it is worth completing the 
analysis of the variance of numbers of beets, A r , as is done in table 12.16. 

TABLE 12 16 

Analysis of Variance of Numbers of Planis per Plot in Sugar Beet Experiment 
Data taken from table 12.14 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Blocks 

5 

7,472 57 

1,495 

Treatments 

6 

116,020 33 

19,337 

Error 

30 

28,665 10 

956 


Two pertinent facts appear. The block stands did not differ signifi- 
cantly, indicating that the differential survival of the plants was not a soil 
characteristic. On the other hand, the stands in the plots differed sig- 
nificantly with treatment. We thus come back to the starting point of our 
investigation; the yields are affected by treatment, but we now have the 
valuable information that the result is brought about chiefly by treatment 
effects on stand rather than on individual beet weight. 

Another question of interest, referred to above, is this: did the' treat- 
ments affect the weight of the individual beets? If you divide each yield 
in table 12.13 by the corresponding number of plants, then analyze the 
results, you will get highly significant differences among the treatment 
means, most of it associated with the contrast between the plots receiving 
superphosphate and those without. (See, also, example 12.10.) Further- 
more, a covariance analysis of mean beet weight adjusted for number per plot 
shows significant differences, again attributable largely to superphosphate. 
There is little doubt that in this experiment the superior yields on the 
plots with superphosphate are attributable to both numbers and weights 
of the plants produced, yield being the product of the two. 

Distinction should be drawn sharply between the uses of covariance 
analysis in the two situations where the treatment mean X’s do and do not 
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differ significantly. The latter circumstance characterized the pig initial 
weights of example 12.3, the object of the covariance analysis being to 
gain increased precision in the test of significance. Significant differences 
among the means of the independent variable, found in both the beet 
numbers and the food intake of section 12 3, focus attention on the causes 
and conditions of the variations in Y. Did the rats gain differently because 
of the differences in food intake, or in spite of them? If one assumes equality 
in stand among plots of beets, would the yields (individual beet weights) 
still give evidence of being different? Physiological questions then come 
up, such as the balance between competition and food supply in the plots 
with varying stands. 

In examining the effect of stand on yield in maize, as another example, 
significant differences in stand may be traced to differential viability in 
seed (associated perhaps with variety) or to experimental methods of 
planting. An investigation of the latter kind may not require covariance 
at all, the knowledge of differences in stand being the object of the experi- 
ment; but the physiologist likes to know if decreased stand was compen- 
sated by increased yield per plant. The problem of viable seed will interest 
the breeder and dealer as well as the farmer. Some part of the covariance 
analysis contains information for each. 

The reader will be well repaid by a critical examination of the nature 
of the data in the line for error , table 12.14. Of what errors of estimate, for 
example, is 6.9969 the sum of squares? The error regression equation of 
the foregoing paragraph might be used for calculating 42 errors of estimate, 
the sum of whose squares is actually 6.9969. Yet little light would be shed 
on the fundamental character of the error regression. This equation 
really relates yield to stand independently of both block and treatment. 
Let us look at the mechanism. But first refresh your memory of expected 
values in section 11.3. Following those methods one could supply ex- 
pected numbers of beets and also expected yields in each of the 42 cells 
of table 12.13. These would lead to two complete sets of deviations from 
expected values. It is these deviations that supply the key to the meaning 
of the error regression statistics. They are independent of block and treat- 
ment, being the remainders from those very expected values which are 
determined by block and treatment constants. If one correlates the 42 
pairs of deviations in the manner of chapter 7, he will get the same result 
as may be got from table 12.14, 

682.20/\/(28, 665. 10)(23. 2326) = 0.8360 
The regression of the yield deviation on the number-of-beets deviation is 
the same as 682.20/28,665.10 = 0.0238. 

EXAMPLE 12.9 — Test the significance of the differences among adjusted block 
yields, proceeding exactly as in table 12 14 with this exception — make up the new sum 
by adding block and error figures. The F value is 11.4, highly significant. How do you 
interpret this result? Examine the adjusted block means. Note the negative correla- 
tion in table 12.14. 

EXAMPLE 12.10— It was suggested that superphosphate might have a fungicidal 
action preventing the loss of the young plants, or that it might stimulate rapid growth 
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during the susceptible stage enabling the seedlings to escape infection. In the experi- 
mental results of table 12.13 is there any evidence about this question? It is easy to 
test the significance of the difference between adjusted means of the yields of plots 
with and without superphosphate. First, analyze the 6 degrees of freedom for treatment 
into two parts, the first having the one degree of freedom representing the contrast 
between plots with superphosphate and those without, while the other five are those 
within the groups of plots just contrasted. The sums and sums of squares and products 
of observations for the four plots with superphosphate and for the three lacking it are 
readily made up from table 12.13. You are then ready to compute this table: 


Source of Variation 

Degrees of 
Freedom 

Sx* 

Sxy 

y 

Treatments (table 12 14) 

6 

116,020 33 

3,598 05 

112 8562 

P vs. none 

1 

101,348 43 

3,218.71 

102 2221 

Lots in the same group 

5 

14,671 90 

379 34 

10 6341 


This table shows that a large part of the variation in both stand and yield is between 
the two groups having and lacking superphosphate. It is especially interesting to test 
the significance of the difference between the adjusted means of these two groups: 


Source of Variation 

S* 2 


Sxy 

y 

Errors of Estimate 

Sum of Degrees of Mean 
Squares Freedom Square 

P vs. none 

101,348 

43 

3,218 71 

102 2221 


1 

0.241 

Error (table 12.14) 

28,665 

10 

682 20 

23 2326 

6 9969 

29 

0 241 

Sum 

130,013 

53 

3,900 91 

125 4547 

8 4123 

30 


Difference for testing significance 

1 4154 

1 

1.415* 


Adjusted mean yields: P , 5 626; None, 4.834 


This shows that the superphosphate produced a significantly larger yield after adjust- 
ment for stand. This effect is small, however, compared with the differences in yield 
due to stand. In this comparison there is a defect that will be remedied in example 
15.12. The conclusions are unaltered. 

EXAMPLE 12 11 — The following data are adapted from those of Crampton (1) 
in such fashion that the means, the error correlation, and the tests of significance are 
little changed. X is food consumed, decreased by 300 pounds for each pig. Y is gain 
in weight, less 50 pounds. The pigs were assigned to the lots in groups of three, the 
three pigs in each group being litter mates of approximately equal weights. 


Group 

Lot 

1 

2 


5 

A' 

Y 

X 

r 

X 

Y 

1 

84 

17 

77 

38 

127 

38 

2 

16 

7 

103 

45 

73 

29 

3 

60 

34 

60 

30 

57 

35 

4 

89 

37 

52 

46 

48 

40 

5 

21 

10 

13 

21 

80 

23 
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By supplying expected values, verify the fact that the error correlation is 0.7312. 
Verify this also by analysis of covariance. Test the significance of group differences 
after adjustment for food consumed, F * 4 . 66 , df . * 4 and 7; also of adjusted treat- 
ment differences, F » 6.30, d.f. » 2 and 7. 

12.8 — Covariance in a latin square. The foregoing covariance 
methods are readily applied when two variates are measured in a latin 
square experiment. For illustration we have taken the data in table 12.17 
from the experiment on millet (2) partly analyzed in table 11.11. The 
additional data on stooling are now introduced to learn if the number of 
culms affects the yield. The sums of squares and products are listed in 
table 12.18. The only new feature is the presence of both rows and col- 
umns instead of only blocks. The data for the computation of correspond- 
ing sums of squares are taken from the column and row of sums, table 
12.17. The sums of squares and products for spacings are calculated from 
the sums, SX and SY , in the appendix at the bottom of the table. 

TABLE 12.17 

Mean Number of Culms per Plant, A\ and Yield (Grams), F, of Millet in the 
Piots of a Latin Square 

The five spacings are denoted by (2), (4), (6), (8), and (10) inches. 



No. X 



Column 



Sums 

Row 

or 

Yield Y 

1 

2 

3 

4 

5 

oA 

SY 

1 

X 

(4) 3 77 

(10) 4 02 

(2) 3 50 

(6) 3 97 

(8) 3 60 

18 86 


Y 

257 

230 

279 

287 

202 

1255 

2 

X 

(8) 4 03: 

(2) 3 00 

(10) 5 37 

(4) 4.02 

(6) 3 60: 

20 02 


Y 

245 

, 

283 

245 

280 

260 

1313 

3 

X 

(10) 4 02 

(4) 3.00 

, 

(6) 3 02 

(8) 3.73 

(2) 2.90 

16 67 


Y 

182 

252 

280 

246 

250 

1210 

4 1 

X 

(2) 2 60 

(6) 4 33 

(8) 3 30 

(10) 3 33 

1(4) 3.33 

16 89 


Y \ 

203 

204 

227 

193 

259 

1086 

5 

\ 

X 

(6) 3 83 

(8) 4 03 

(4) 4 00 

(2) 2 47 

(10) 5 53 

19 86 


Y 

231 

271 

266 

334 

338 

1440 

Sum 

SX 

18 25 

18 38 

19 19 

17 52 

18 96 

92 30 


SY 

1118 

1240 

1297 

1340 

1309 

6304 


SX 1 = 353 3144 SXY = 23,326 36 SY 2 = 1,626,188 




Spacing Sums and Means 



(2) 

(4) (6) 

(8) 

(10) 

Sums 

SX 

14 47 

18 12 18 75 

18 69 

22 27 


SY 

1,349 

1,314 1,262 

1,191 

1,188 

Means i 

X \ 

2 89 

3 62 3 75 

3 74 

4 45 


Y 

269 8 

262 8 252 4 

238 2 

237 6 
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TABLE 12.18 

Covariance in a Latin Square Experiment on the Effects of Spacing Upon the 
Stooung and Yield of Millet 


Source of 
Variation 

Degrees 

of 

Freedom 

Sums of Squares and Products 

Errors of Estimate 

S# 

Sxy 

sy 

Degrees 

Sum of of Mean 

Squares Freedom Square 

Total 

24 

12 5428 

51.99 

36,571 36 



Rows 

4 

2.0445 

139 07 

13,601 36 



Columns 

4 

0 3434 

1.54 

6,146 16 



Spacings 

4 

6.1378 

-132 62 

4,156 56 

: 


Error 

12 

4.0171 

44 00 

12,667 28 

12,185 34 

11 1,107.8 

Spacing 







plus error 

16 

10.1549 

- 88.62 

16,823 84 

16,050 47 

15 


Difference for testing significance of spacing 3,865 13 4 966.3 


Since we are interested only in the effect of spacing on yield per stool, 
we make the test of significance at the bottom of table 12.18 The ad- 
justed means vary actually less than one might expect in 50% of trials 
from a homogeneous population. It makes no difference, then, whether 
the adjusted or unadjusted yields are tested, the differences are not 
significant. 

You may raise a pertinent question at this point if you haven’t done so 
before. Why should yield be adjusted for number of culms? If close 
planting were found to give significantly greater yields, what difference 
would it make if the number of culms were less? The physiologist and the 
producer might differ in their answers to the question. Whether to adjust 
for number of culms or not depends upon the information desired. It is 
for the investigator to decide what question he is proposing. Don’t use 
covariance unless there is a well-considered reason for doing so. 

EXAMPLE 12.12 — Test the significance of the differences among row yields of 
millet. You will get F — 3.22, df— 4 and 1 2. Then test the significance of differences 
among adjusted row yields. Your test will be based on F = 1.93, d.j. = 4 and 11. 

EXAMPLE 12.13 — In the manner of example 11.26, insert in table 1217 expected 
values for both culms and yields. Make a table of 25 pairs of deviations from expected 
Calculate r, then verify by use of the regression statistics in the line for error, table 12.18. 
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Chapter 13 

MULTIPLE REGRESSION AND COVARIANCE 


13.1 — Introduction. Drawing together the threads of the foregoing 
methods, multiple regression completes a fabric of increasingly complex 
patterns. Starting with a unique sample, the number of variates and the 
number of groups have been alternately expanded. We are now ready 
to make another extension, increasing the number of variates beyond 
two. Since three or more characters are often measured, their mutual 
relationships must be evaluated. At present three variates in a single 
group will be considered. Afterwards multiple groups will receive atten- 
tion. Four and more variates offer no new principles, only increasing the 
extent and intricacy of the calculations. 

13.2 — Three variates in a single group. The increase from two 
variates to three adds a surprising number of new concepts to those 
developed in chapters 6 and 7. Perhaps the most striking is the necessity 
for three dimensions in which to depict the variates graphically. Solid 
geometry is required instead of plane. If Y partly depends upon A'i and 
X 2 for its value, three mutually perpendicular axes are demanded for 
graphical presentation. is measured along one, X 2 is laid out parallel 
to the second and Y rises from the plane X\X* parallel to the third axis. 
The points in space fixed by the triplets of values (AT A' 2 , Y) determine 
a regression plane whose equation is 

^ = b + by 1-2 X\ + b Y 2.\ AT 

byi .2 and b Y 2.1 being the partial regression coefficients . The first is read, “the 
regression of Y on X\ independent of AV” 

This leads to a second complication. Among the three variates there 
are three correlation coefficients, r Yh r and ri 2 *t How may these be 
combined into two regression coefficients? The principle of least squares 
is again invoked. The regression plane is designed to pass among the 
points so that the sum of squares of vertical distances from plane to points 
shall be a minimum — the least, that is, for any plane surface. Curvilinear 
regression is not considered in this chapter. 

The methods will be illustrated by use of the data on pigs, table 13.1. 
These animals were kept on adequate rations during the period of ap- 
proximately uniform growth rate. We wish to evaluate the relationships 

f The correlation method will be described first. Beginning with section 13.11, the al- 
ternative direct method will be presented. 
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among initial weight, initial age and average daily gain, and to inquire 
the amount of information about rate of gain which is furnished in ad- 
vance by the two independent variates. 

The computations are conveniently arranged as in table 13.2. The 
sums of the 40 items, the sums of their squares and of their products 
(lines 1) arc entered in the table from the dials of the calculating machine. 
In lines 2 are the usual correction terms. As illustrations, in line ^Yi2, 


TABLE 13.1 

Initial Age (AT i ) , Initial Weight ( X 2 ), and Rate of Gain (F) of 40 Swine 
Growing Lmformly 


Fig No. 

Initial 
Age Xi 
(days) 

Weight X, 
(pounds) 

Rate of 

Gain V 
(pounds 
per day) 

Estimated 
Rate P 

Error of 
Estimate 

Y -? 

1 

78 

61 

1 40 

1 45 

- 05 

2 

90 

59 

1 79 

1 39 

40 

3 

94 

76 

1 72 

1 50 

.22 

4 

71 

50 

1 47 

1 39 

08 

5 

1 99 

61 

1 26 

1 38 

-.12 

6 

80 

54 

1 28 

1 39 

- 11 

7 

83 

57 

1 34 

1 40 

- 06 

8 

75 

45 

1 55 

1 34 

21 

9 

62 

41 

1 57 

1 35 

.22 

10 

67 

40 

1 26 

1 33 

- 07 

11 

78 

74 

1 61 

1 54 

.07 

12 

99 

75 

1 31 

1 48 

- 17 

13 

80 

64 

1 12 

1 46 

- 34 

14 

75 

48 

1 35 

1 36 

- 01 

15 

94 

62 

1 29 

1 40 

-.11 

16 

91 

42 

1 24 

1 26 

- 02 

17 

75 

52 

1 29 

1 39 

- 10 

18 

63 

43 

1 43 

1 36 

07 

19 

62 

50 

1 29 

1 .42 

-.13 

20 

67 

40 

1 26 

1 33 

- 07 

21 

78 

80 

1 67 

1 59 

08 

22 

83 

61 

1 41 

1 43 

- 02 

23 

79 

62 

1 73 

i 45 

28 

24 

70 

47 

1 23 

1 37 

- 14 

25 

85 

59 

1 49 

1 41 

.08 

26 

83 

42 

1 22 

1 29 

- 07 

27 

71 

47 

1 39 

1 37 , 

02 

28 

66 

42 

1 39 

1 35 

04 

29 

67 

40 

1 56 

1 33 

.23 

30 

67 

40 

1 36 

1.33 

03 

31 

77 

62 

1 40 

1.46 

- 06 

32 

71 

55 

1 47 

1 42 

05 

33 

78 

62 

1 37 

1 .45 

- 08 

34 

70 

43 

1 15 

1 34 

- 19 

35 

95 

57 

1 22 

1 36 

- 14 

36 

96 

51 

1 48 

1 .31 

17 

37 

71 

41 

1 31 

1 32 

- 01 

38 

63 

40 

1 27 

1 34 

- 07 

39 

62 

45 

1 22 

1 33 

- 16 

40 

67 

39 

1 36 

1 32 

04 
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TABLE 13.2 

Calculation of Sums of Squares and Products and Correlation 
Coefficients for Pig Data 


n = 40 

Xi 

A r 2 

Y 

Sums 

3,082 

2,109 

55 53 

Means 

77 050 

52.725 

1.3882 

A'i 1. SXf, S.WXi, SX, Y 

242,204 

165,536 

4,285 51 

2. Correction terms 

237,468.10 

162,498 45 

4,278.5865 

3 . Sxi 2 , 6 * 1 * 2 , Sxy 

4,735 90 

3,037 55 

6 9235 

4. V^i 2 , V (6*i 2 ) (Sx* 2 ), etc. 

68 8179 

4,898 16 

69 5983 

5. r 12 , Ty 1 


0 6201 

0 0995 

X% 1. SX 2 \ SXzY 


1 J 6,263 

2,955 36 

* 2. Correction terms 


111,197 02 

2,927 8192 

3. Sx 2 2 , Sx 2 y 


5,065.98 

27 5408 

4. vSx \/(Sx 2 2 ){Sy 2 ) 


71 1757 

71 9828 

5. r Y i 



0 3826 

Y 1. SY* 



78.1123 

2. Correction term 



77 0895 

3 . Sf 



1 0228 

4 . vV 



1 01134 

5. x„ 



0 162 


column Xi, (3,082) 2 /40 = 237,468.10; while in line X 2 2, column F, 
(2,1 09) (55.53)/ 40 = 2,927.8192. The differences in lines 3 are sums of 
squares and products of deviations from means. In each line 4 the first 
number is the square root of the sum of squares just above. The other 
numbers in these lines are products of these square roots. For example, 
in line X\4, column F, V (Sx{ 2 ) (S'y 2 ) = (68.81 79) (1 .01 1 34) = 69.5983. 
These products are the denominators of the correlation coefficients, the 
coefficients being calculated in the usual manner; as an example: 


3,037.55 

4,898.16 


0.6201 


13.3 — The multiple regression equation. Now comes the first new 
feature in the calculation of multiple regression. The application of the 
principle of least squares leads to a pair of simultaneous normal equations, 

^Vi-2 + ri 2 b' Y 2-i = r Y i 
T 12 b f yi'2 4 “ b'y2.1 = Ty 2 


in which b' Y i. 2 and b' Y2 .i represent the standard partial regression coefficients 
that will be used to compute the regression of Y on Aj and AV The 
^-primes are used in order to distinguish these standard regression co- 
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efficients from the 6’s of the equation in the preceding section. The rela- 
tion between the 6-primes and the 6’s will emerge presently. The symbol 
b'yi -2 indicates the standard regression of Y on X\ independent of X*. 

The solution of the normal equations yields the formulas, 


6Vl*2 


Ty 1 **“ r Y 2 f 12 
1 - M2 2 ’ 


b f Y<i>\ 


Ty 2 ” Tr 1 r V2 
1 ~ ri2 2 


Substituting the pig data, 

t , 0.0995 - (0.3826)(0.6201) 
r1 ' 2 ~ 1 - (0.6201 ) 2 


— 0.2239 


6' 


Y 2-1 


0.3826 - (0.0995) (0.6201) 
1 - (0.6201) 2 


= 0.5214 


If wc use standard measure (section 7.4), the regression equation is 

$ — 6Vi-2 x\ + b f y 2'\ xf 
= — 0.2239x/ + 0.5214* 2 ', 


where x\ = {X\ — x^)/s h etc. That is, the 6-primes are the regression 
coefficients if standard measure is used. Since the ordinary units of 
measurements must be returned to, the equation is usually given in some 
such form as this: 


= y + 6 / n.t(^Al)(^l ®l) + b f Y 2-l(s V /s2)(X2 — X 2 ) 


It is clear that the 6’s are computed from the 6-primes through multi- 
plication by the ratio of two standard deviations containing the desired 
units of measurements. Usually it is more convenient to calculate the 
equation, 

\/Vv 2 -v/cyi 

? = y + b'yi-2 — -~izr(Xi — Xi) + 6V2.1 — rr (A r 2 — £2) 

V^I 2 y/Sx 2 2 

Substituting the 6-primes and data from table 13.2, 

~ 1.01134 

f = 1.3882 - 0.2239 (A r i - 77.05) 

68.81 79 v 7 


1.01134 

+ 0.5214 __(.V 2 - 52.725) 

- 1.3882 - 0.003290(Ai - 77.05) + 0.007409(A r 2 - 52.725) 

= 1.3882 - 0.003290-Yi + 0.2535 + 0.007409A r 2 - 0.3906 
= 1.251 - 0.003290A r i + 0.007409A r 2 

The regression coefficients indicate that in this sample of pig data the 
average daily gain decreased 0.003290 pound per day with each day 
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increase in initial age, but increased 0.007409 pound per day with each 
pound increase of initial weight. Both coefficients specify changes in F 
quite independent of changes in the other independent variate. How- 
ever, one should not forget that age and weight tend to increase together 
(r 12 = 0.6201); hence, in changing from one individual to another, F 
usually gets both a positive and negative increment with a tendency to 
partially counterbalance. To see this, consider two pigs whose errors of 
estimate are small: 


Number 


Age 

Weight 

Average Daily Gain 

14 


75 

48 

1 35 

22 


83 

61 

1 41 


Since the second is 8 days older than the first, its rate of gain is expected 
to be (8) (0.003290) = 0.0263 pound less; but since it weighs 13 pounds 
more its rate is expected to be (13) (0.007409) = 0.0963 greater. These 
expected changes being concurrent, the net difference in rate should be 
0.0963 — 0.0263 = 0.07 pound to be compared with actual difference, 
1.41 — 1.35 — 0.06 pound. 

You will be interested in the positive correlation between age and 
rate, r Y \ — 0.0995, and the negative regression, b Y 1-2 — —0.003290. At 
first this is puzzling. In the sample as a whole gain increases slightly with 
initial age, but in groups having the same initial weight gains are smaller 
among the older pigs. It is the same kind of relation you might have 
found in the foregoing chapter if the total regression had been positive 
when that within groups was negative. An easy way to clarify thinking 
is to divide the pigs into weight classes, as in table 13.3, then treat them in 
the manner of covariance. If weight is ignored, the total correlation is 
r Y i = 0.0995, with total regression, 

? = 1.388 + 0.0995 1-01 - (A", - 77.05) = 0.001462A', + 1 275 

Within the weight classes, however, the regressions are predominantly 
negative (table 13.4), the average being —0.003347. (The small dis- 
crepancy between this value and that of the partial regression coefficient, 
— 0.003290, is due to the slightly different hypotheses involved.) For 
your thinking, then, you have the average regression of gain on age within 
weight classes, —0.003347, contrasted with the overall regression, 
0.001462. A graph similar to that of figure 12.2 will clarify the distinction. 
Putting all these ideas together, you will see readily that the regression of 
gain on age within the initial weight groups is the same kind of relation 
indicated by b Y 1 . 2 . It is independent of Xi because all the pigs of a group 
have approximately the same weight: it is an estimate of the regression 
in a population of pigs all starting with the same weight. In a similar 
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TABLE 13.3 

Data on 40 Swine (Table 13.1) Classified by Initial Weight 


Initial 

Number 




Weight 

of Swine 

Age and Average Daily Gain 

Total 

Mean 


- 

62 67 91 63 67 83 66 67 



39-44 

13 

1 57 1 26 1 24 1 43 1 26 1 22 1 39 1 56 





67 70 71 63 67 

904 

69 5 



1 36 1 15 1 31 1 27 1 36 

17 38 

1 34 



75 75 70 71 62 

353 

70 6 

45-49 

5 

1 55 1 35 1 23 1 39 1 22 ’ 

6 74 

1 35 



71 80 75 62 96 

384 

76 8 

50*54 

5 

1 47 1 28 1 29 1 29 1 48 

6 81 

1 36 



90 83 85 71 95 

424 

84 8 

55-59 

5 

1 79 1 34 1 49 1 47 1 22 

7 31 

1 46 



78 99 80 94 83 79 77 78 

668 

83 5 

60-64 

8 

1 40 1 26 1 12 1 29 1 41 1 73 1 40 1 37 

10 98 

1 37 



78 94 99 78 1 

349 

87 2 

74-80 

4 

1 61 1 72 1 31 1 67 

6 31 

1 58 




3,082 j 

77 05 

Total 

40 


55 53 

1 338 


manner, if the pigs were divided into groups having approximately the 
same initial age the average regression of gain on initial weight, inde- 
pendent of age, would be found to be about 0.007409 pound per pound 
of initial weight. By use of multiple regression we accomplish in a single 
equation the segregation of the independent contributions of X\ and X 2 
to the dependent variate Y . 

If you wish to get an idea of the relative potency of age and weight in 
estimating rate of gain, you must avoid comparing the ^-coefficients. 

TABLE 13.4 

Analysis of Covariance in Weight Classes of Swine 




Sums of Sqi 

uares and Products 


Weight 

Degrees of 




i 

Regression oi 

Class 

Freedom 

Sx? 

Sx iv 

•V 

Y on Xi 

39-44 

12 

831 2308 

-6 1885 

0 1917 j 

-0 007445 

45-49 

4 

113 2000 

2 0860 

0 0729 j 

0 018428 

50-54 

4 

634 8000 

2 5720 

0 0427 

0 004052 

55-59 

4 

324 8000 

-0 6480 

0.1819 ' 

-0 001995 

60-64 

7 

486 0000 

-3 6700 

0 2140 

-0 007551 

74-80 

3 

354 7500 

-3 3375 

0 1015 

— 0 . 009408 

Sum 

1 34 

2,744 7808 

— 9 1860 

0 8047 

| -0 003347 

1 



346 


Statistical Methods 


Section 13.4 


They are fractions with si in one denominator and S 2 in the other. If 
these two standard deviations were very different the two regression co- 
efficients would reflect this consequence of units of measurement. It is 
the & '-coefficients that should be compared. Look back at the regression 
equation in standard measure. If X\ and X 2 are normally distributed, 
then xi and * 2 ' are both estimates of normal deviates with the same range 
and with standard deviations equal to 1. Changes in * 1 ' and x% are 
reflected in # according to the values of b' Y i -2 = —0.2239 and b'y *- 1 
= 0.5214, respectively. Naturally, you can’t be too sure of population 
parameters from a comparison of these two statistics. They are subject 
to sampling variation. But in the sample itself the ^-primes give the infor- 
mation desired. Among the data of table 13.1, initial weight is more than 
twice as effective in forecasting as is initial age. 

EXAMPLE 13.1 — Here is a set of 20 triplets arranged for easy computation: 


Xi 

A 2 

r 

1 A', 

X t 

r 

29 

2 

22 

1 

21 

2 

24 

1 

4 

26 

12 

1 

7 

5 

3 

23 

24 

3 

23 

31 

1 

8 

16 

3 

28 

25 

3 

25 

6 

4 

25 

16 

1 

12 

20 

2 

22 

26 

1 

13 

35 

1 

25 

15 

4 

30 

9 

4 

32 

6 

2 

12 

19 

4 

37 

10 

3 

26 

14 

2 

20 

Calculate the regression, 

f « 

0.35Aq + 7.1 6 X 2 - 

- 1.9 



EXAMPLE 13.2 — In the foregoing example compare ryi — —0.1354 with 
byi . 2 *=* 0.3538. Make a table like 13 3 classifying the data in four groups having Xz 
equal respectively to 1, 2, 3, and 4. Except for the values of Xz the table will be the 
same as example 12 1 . Calculate the average regression of Y on X\ within the groups. 
You will get 0.3621 as a companion for 6 ^ 1*2 ~ 0.3538. 

EXAMPLE 13.3 — Compute the results recorded in table 13.4. 

13.4 — Errors of estimate. The uses of the multiple regression equa- 
tion are much the same as those described for the two- variable regression 
in chapter 6. One may estimate the rate of gain of each animal if he 
wishes. Pig number 1 is expected to gain 

t = 1.251 - (0.003290) (78) + (0.007409) (61) = 1.45 pounds per day 

Actually the gain was 1.40 pounds per day, the error of estimate being 
1.40 — 1.45 = —0.05 pound per day. These results are entered in the 
last two columns of table 13.1 . In accordance with the theory, the sum of 
the errors of estimate is zero, while the sum of their squares is the least 
possible for any linear regression. (Geometrically, linear regression in 
three dimensions represents a plane.) 

While one isn’t ordinarily much interested in the errors of estimate 
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for individual pigs, there are experiments where these data are the chief 
objective of the investigation. Let us see what kinds of information they 
contain. The one largest in absolute value is that for pig number 2. 
Being 3 months old with a weight of only 59 pounds, he was expected to 
gain only 1.39 pounds per day, but actually made the best gain of all, 1.79 
pounds per day. One might be much concerned to know why. One 
might also wish to compare the performance of this pig with that of num- 
ber 23. If no attention is paid to initial age and weight, the difference in 
rate of gain is only 1.79 — 1.73 = 0.06 pound per day. But that com- 
parison would be unfair to number 2 when X\ and X 2 are taken into ac- 
count. For his age and weight, number 2 was 0.40 pound per day better 
than average, whereas number 23 was only 0.28 better, a difference in 
favor of number 2 of 0.12 pound per day. Multiple regression enables 
one to evaluate individuals and to compare them on the basis of their 
deviations from average performance, due account being taken of known 
information contained in the independent variates. 

Recalling the facts of section 7.7, one may wish to know the correlation 
between Y and f T in table 13.1. It could be calculated in the manner of 
chapter 7, but a short cut is available. The icsult is known as the multiple 
correlation coefficient , and is arrived at by this formula: 

R 2 = r Y i b'ri '2 + Tyi ^Va-i 

= (0.0995) ( — 0.2239) + (0.3826)(0.5214) = 0.1772, 

whence R = 0.421 is the correlation one would get by the routine calcu- 
lation. This coefficient measures the success of estimating rate of gain 
from the two preliminary items of information. It is always less than 
unity, but greater than either of the coefficients r Y \ or r Y 2 , a fact which 
is sometimes valuable in detecting errors in calculation. 

The variation in the estimated values, 1, of table 13.1 is due to re- 
gression: if T were plotted against X\ and X 2 in a three-dimensional 
graph, every plotted point would lie in the regression plane. On the con- 
trary, the errors of estimate result from failure in estimating. In the 
graph they would be represented by line segments extending above or 
below the regression plane to the points represented by (Xi, X 2 , Y). In 
table 13.1 one could run up the sum of squares of deviations from mean in 
each of these last two columns. He would find them to' be respectively 
(see section 7.8 for an analogy): 

R 2 Sy 2 = (0.1772)(1.0228) = 0.1812 
(1 -R 2 )Sf = (1 — 0.1 772) (1 .0228) = 0.8416 

That is, the total sum of squares is separated into two parts, the first, 
0.1812, explained by regression, and the second, 0.8416, independent of 
regression. The latter is the sum of squares of errors of estimate, leading 
directly to the standard error of estimate , 

/Tl-0T772)(T:6228) aici 

*r-n = = J — = 0.151 pound per day 
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This measures variation among errors of estimate in the same way that s v 
measures variation in average daily gains. The introduction of the two 
control variates has resulted in a reduction of the unexplained variation 
from s y = 0.162 (table 13.2) to s Y .n = 0.151 pound per day. The degrees 
of freedom, n — 3, are less than n by the number of variates. Actually, 
three independent averages have been used in the regression, the mean of 
Y and the two partial regression coefficients. One degree of freedom is 
deducted for each. 

If the portions of information furnished by X\ and X 2 tend to be inde- 
pendent, r 12 is likely small and R may be much larger than either r yl or 
Ty 2 - In the pig data the facts arc otherwise. X x and X 2 are correlated 
rather strongly, age and weight being concurrent phenomena in young 
growing animals. It is not surprising, therefore, to find R = 0.421 little 
greater than r Y2 = 0.3826. Almost as good a job of estimating can be 
done with weight alone as independent variate. Age contributes little 
additional information. 

Occasionally one is not so much interested in the partitioning of Sjr 
into R 2 Sr 2 and (1 — R 2 )Sy* as in a corresponding division of the variance, 
s v 2 ~ — !)• For any number of variates, rn, this is done in table 

13.5. The mean square for deviations is expressed in terms of s 1 by substi- 


7 AISLE 13 5 

Analysis of Variancl in Ki.orlssion, m Varia'ils 


Source of 
Variation 

Degrees of 
Fleet loin 

Sum of 
Squares 

Mean 

Square 

Variance 

Exact n Large 

Total 

i 

n — 1 

s>* 

'Sf 

n — \ 

.f 2 j 2 

Regression 

m — 1 

r * s / 

AW 

fl -(1 /.’V 

L n-mj 

Deviations 

n — m 

1 

'5 

(1 -AW 

n — m 



tuting for Sy 2 its equal, ( n — 1)j 2 , giving the exact variance in the last 
line of the table. Subtraction from the total variance, s 2 , leaves the vari- 
ance attributable to regression in the second line. 

An adjusted value of R 2 , 

RJ = 1 - (1 - R*)— 

n — m 

simplifies the formulas for exact variance. In terms of R A 2 the total vari- 
ance, s 2 , is seen to be partitioned into the two fractions, 

R a 2 s\ attributed to regression, and 
(1 — R a 2 )s 2 , deviations from regression 
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In large samples the fraction, (n — \)/{n — m), differs little from 1; 
hence, R A 2 is approximately equal to R 2 . This is the origin of the not 
uncommon but approximate statement that the variance of the dependent 
variate is the sum of the two parts, R 2 s 2 due to regression and (1 — R 2 )s 2 , 
deviations from regression. 

EXAMPLE 13.4 — Calculate R » 0.8837 for the data of example 13.1. 

EXAMPLE 13.5 — Compute the regression of A'i on Y and X z in example 13.1 
li « 1.0289K - 10 162^ + 19.77, R - 0.7396 
Save your work sheets for use later. 

EXAMPLE 13.6 — Derive the two formulas: 

^ ryi 2 H- r \ 2* — I r YiTYiT xi 
1 — ria 2 

Rt * // yj . 2 2 + b ' yi x i 4. 2r„5W'r*i 

EXAMPLE 13.7 — Do you think it would be possible to get Ty\ — 0 6, ry 2 ~ 0.8, 
and fn = —0.5 from a set of experimental data.' 1 

EXAMPLE 13.8 —Starting with r Y i — 0.6 and r Yi 383 0.8, try the effects on R 
produced by various values of ri 2 . 

EXAMPLE 13.9 — Calculate R — 0.89 starting with r Yi — 0.21, ry 2 ~ 0.19, and 
r 12 = —0.9. Think of Xi as the temperature in a greenhouse, A' 2 , the relative humidity 
and Y the rate of growth of a plant. 

13.5 — Tests of significance. All of the statistics of multiple regression 
are subject to sampling variation, of course, so that measures of variation 
and tests of significance are in order. The standard errors of the two 
^-primes are identical, 

1 _ I r^(U772 _ 

(1 - r „»)(«- 3) “ V [1 - (0.6201 )*](37) ~ ° 19 ° 

Significance is tested by means of /; that is, the usual null hypothesis is 
set up and the probability of exceeding the sample value is determined. 
In our illustrative example, 

for b' ri . 2: t = 0.2239/0.190 = 1.18, 
while for b‘ Y 2 .j: t = 0.5214/0.190 = 2.74** 

The signs of the b - primes are ignored. For 37 d.f. the second is highly 
significant. 

Irrespective of the results from testing the ^-primes, the significance 
of the multiple regression as an entity may be tested. The test, identical 
with the test of the significance of R , is effected by using F to determine 
whether the mean square due to regression is significantly greater than 
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that unexplained by regression. The test is conveniently thrown into the 
form of analysis of variance: 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Regression, R*Sy 2 

2 

0 1812 

0.0906 

Errors of estimate, (1 — R^Sy 2 

37 

0 8416 

0.0227 

Total 

39 

1.0228 



F « 0.0906/0.0227 «* 3.99* 


For a more convenient test table 13.6 is entered for R (10) in the same 
manner as table 7.3 was for r. One merely has to look at the left for the 
degrees of freedom, n — 3, then read in the column for 3 variables the 
values of R at the 5% and 1% points. If the sample value is less than the 
5% point, evidence against the null hypothesis is considered to be too 
slight for significance. Values of R between the two points are called 
significant, while those greater than the 1% point are designated highly 
significant. In the illustrative example, R = 0.421, dj . = 37, lies between 
the 5% and 1% points. Such a value is not likely to arise in sampling 
from a population with zero R , but occasionally may. 

Naturally the results from the two methods which have been given for 
testing R must be identical. Use the one that is more available. For 
convenience, the 5% and 1% values of r copied from table 7.3, arc en- 
tered in the column for 2 variables, table 13.6. The other two columns 
will be explained later. 

As in chapter 6, one can now make fiducial statements about the mean, 
#, about the regression coefficients and about T or Y. For convenience, 
this topic will be deferred to section 13.11. 

13.6 — Multiple covariance in groups. Experimental data aren’t 
often taken in single groups like that of table 13.1. Animals are too 
expensive, time is too precious and too many treatments are awaiting 
trial. Usually some design like those in chapters 10 and 11 is used. These 
are all readily adapted to the necessities of three variables. Two examples 
will be worked out. 

As was mentioned earlier, the 40 swine of table 13.1 really constituted 
four lots whose rations and gains were not greatly different. We shall 
illustrate the simplest case of multiple covariance (more than two variates 
in two or more groups) by means of the four lots of pig data. We wish to 
test the significance of the differences among adjusted means in the 
manner of section 12.3. Since there are now three variates instead of two, 
the methods of multiple regression are required. 

If we were starting with a new sample of data we should first calculate 
the multiple regression of the entire set exactly as has been done in the 
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TABLE 13.6 

The 5% and 1% Points for r and R 


Degrees 

Number of Variables 

Degrees 

Number of Variables 

of 





of 













Freedom 

2 

3 

4 

5 

Freedom 

2 

3 

4 

5 

1 

.997 

999 

.999 

999 

24 

.388 

.470 

.523 

.562 


1.000 1 

000 1 

000 1 

.000 


.496 

.565 

.609 

.642 

2 

.950 

975 

.983 

.987 

25 

.381 

462 

.514 

553 


990 

.995 

.997 

.998 


.487 

.555 

.600 

633 

3 

878 

930 

950 

.961 

26 

374 

454 

506 

545 


.959 

.976 

.983 

.987 


.478 

546 

590 

.624 

4 

811 

881 

.912 

.930 

27 

.367 

.446 

.498 

.536 


917 

.949 

962 

.970 


.470 

538 

.582 

615 

5 

.754 

836 

.874 

.898 

28 

.361 

.439 

.490 

529 


874 

917 

937 

949 


.463 

.530 

.573 

.606 

6 

707 

795 

.839 

867 

29 

.355 

.432 

482 

521 


834 

886 

911 

.927 


.456 

.522 

.565 

.598 

7 

.666 

.758 

.807 

838 

30 

349 

.426 

476 

514 


.798 

855 

885 

904 


.449 

514 

558 

591 

8 

.632 

726 

.777 

811 

35 

325 

.397 

445 

.482 


765 

827 

860 

.882 


418 

481 

523 

556 

9 

602 

697 

750 

786 

40 

.304 

.373 

419 

455 


.735 

800 

.836 

.861 


.393 

.454 

.494 

.526 

10 

576 

671 

726 

763 

45 

288 

.353 

.397 

432 


708 

776 

814 

840 


372 

.430 

.470 

.501 

11 

553 

648 

703 

741 

50 

.273 

.336 

379 

.412 


684 

753 

.793 

821 


.354 

.410 

449 

479 

12 

532 

627 

683 

.722 

60 

.250 

308 

348 

.380 


661 

732 

773 

802 


.325 

.377 

414 

.442 

13 

514 

608 

.664 

.703 

70 

.232 

286 

.324 

354 


641 

712 

755 

785 


302 

351 

.386 

413 

14 j 

497 

. 590 

.646 

686 

80 

217 

269 

304 

.332 


623 

.694 

737 

768 


283 

330 

362 

389 

15 

482 

574 

630 

670 

90 

205 

254 

288 

315 


606 

677 

721 

752 


267 

.312 

343 

.368 

16 

468 

559 

615 

.655 

100 

.195 

241 

274 

300 


590 

.662 

.706 

.738 


254 

297 

.327 

.351 

17 

456 

545 

601 

641 

125 

174 

216 

246 

.269 


575 

647 

691 

724 


228 

266 

294 

316 

18 

444 

532 

587 

628 

150 

159 

198 

.225 

247 


561 

633 

678 

710 


.208 

.244 

.270 

.290 

19 

433 

520 

575 

615 

200 

138 

.172 

196 

215 


.549 

620 

665 

.698 


181 

212 

234 

253 

20 

423 

509 

. 563 

604 

300 

113 

141 

160 

176 


537 

608 

652 

685 


148 

174 

192 

208 

21 

413 

.498 

552 

592 

400 

098 

122 

139 

.153 


526 

596 

.641 

674 


.128 

151 

167 

.180 

22 

404 

488 

542 

582 

500 

088 

.109 

124 

.137 


515 

.585 

630 

.663 


115 

135 

.150 

.162 

23 

396 

479 

532 

572 

1000 

062 

.077 

088 

.097 


505 

574 

.619 

.652 


081 

.096 

.106 

.115 
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first part of this chapter. The sums of squares and products, table 13.2, 
are entered as total in the first lines of table 13.8, while the sum of squares 
of errors of estimate, (1 — R 2 )Sy 2 = 0.8416, is copied in the first line of 
table 13.9. 

Next, in order to segregate the effects of lots, we make a table of the 
lot sums, table 13.7. From these we calculate sums of squares and products 


TABLE 13.7 

Data for Four Lois of 10 Swine Each, Taken From Table 13.1 


Lot 

Pig 

Numbers 


Sums 


Initial 

Age 

Initial 

Weight 

Average 

Daily Gain 

1 

1-10 

799 

544 

14 64 

2 

11-20 

784 

550 

13 19 

3 

21-30 

749 

520 

14.45 

4 

31-40 

750 

495 

13 25 


for lots in the same way that table 13.2 was calculated for the individuals. 
The only difference is the use of the scheme of calculation already familiar 
in analysis of variance and covariance. As examples, the first entry /in 
line 2 of table 13.8 is 


(799) 2 + (784) 2 + (749) 2 + (750)2 

To ‘ 

- 237,665.80 - 237,468.10 = 187.70, 


while the second, column A' 2 , is 


(799) (544) + . . . + (750) (495) 

To 


- 162,498.45 = 160.15 


The correction terms for lines 2 are the same as those of table'll 3.2. 

The differences in lines 3 are the pooled sums of squares and products 
for individuals within lots, the first objective in this table. Two remarks 
are in order. First, the sums of squares and products might have been 
calculated directly for each lot, then pooled in the manner of table 12.6, 
but the use of lot sums is easier. Second, when there are only two subdivi- 
sions of the total, as in the present example, the correction terms need not 
be subtracted in lines 1 and 2. For example, Sx i 2 in line 3 might well ha\e 
been calculated from the uncorrected sums of squares, 

242,204 - 237,655.80 = 4,548.20 

We have followed the usual routine because of its familiarity and gener- 
ality. 
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TABLE 13.8 

Sums of Squares and Products for Total, for Lots, and Within Lots. Calculation 
of Correlation Coefficients Within Lots 



Source of Variation 

Age 

Xi 

Weight 

Xt 

Daily Gain 

Y 

A'i 

Total from table 13.2, line 3 
Lots 

4,735 90 
187.70 

3,037.55 

160 15 

6 9235 

1 3005 


Pigs within lots 

V-W, etc. 

ri2, ryi 

4,548 20 

67 4403 

2,877.40 
4,709 68 

0 6110 

5.6230 

62 0011 

0 0907 

-Y, 

Total from table 13.2, line 3 
Lots 


5,065.98 

189 08 

27 5408 
1.3218 


Pigs within lots 

rr 2 


4,876 90 
69.8348 

26 2190 
64.2025 

0 4084 

Y 

Total from table 1 3 2, line 3 
Lots 



1 0228 
0.1776 


Pigs within lots 

\/3? 



0 8452 

0 9193 


The second objective of table 13.8 is the computation of the three r’s 
from the pooled sums of squares and products within lots. These differ 
little from those of tabic 13.2 because the lot means are much alike. 

From the three r's within lots are now calculated V Y \* 2 = —0.2535, 
b\ 2 -i — 0.5633, and R 2 — 0 20706. Finally, the sum of squares of errors 
of estimate from this average regression within lots is 

(1 - 0.20706) (0.8452) = 0.6702, 

entered in line 2 of table 13.9. With the two sums of squares, total and 
error, the test of significance is made exactly as in table 12.3. The re- 

TABLE 13.9 

Analysis of Variance of Average Daily Gains and of Errors of Estimate 
of 40 Swine in 4 Lots 


Source of 
Variation 

Degrees of 
Freedom 

Ave. Daily Gain 

1 

R 2 

Errors of Estimate 

Sum of Mean 
Squares Square 

Degrees of Sum of Mean 
Freedom Squares Square 

Total 

39 

1 0228 

0.17722 

37 0 8416 

Error 

36 

0 8452 0 0235 

0.20706 

1 

34 0.6702 0 0197 

Lots 

3 

0 1776 0 0592 

1 

1 

1 

3 0 1714 0 0571 * 
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mainder, 0.1714 with d.f. = 3, is appropriate for testing the differences 
among lot mean rates of gain adjusted to average initial age and weight. 
The degrees of freedom in the first and second lines of the table are each 
less by two than the corresponding numbers available for analysis of 
variance. The deducted degrees of freedom correspond to the two partial 
regression coefficients used in these two lines. 

Without information about initial weight and age, differences among 
lot means would be tested (table 13.9, column 3) by 

F = 0.0592/0.0235 = 2.52, 

somewhat short of the 5% point, 2.86. The reduction in error variance 
from 0.0235 to 0.0197 together with adjustment of the means to a com- 
mon initial weight and gain have increased the precision of the experi- 
ment, and now 

F = 0.0571/0.0197 = 2.90, 
slightly beyond the 5% point. 


EXAMPLE 13.10 — Divide* the data of table 131 into 2 parts, as follows: 



Sum of Ages 

Sum of Weights 

Sum of Rates 

Lot3 

SXi 

SXt 

sr 

1 and 3 

1 1,548 

1,064 

29 . 09 

2 and 4 

| 1,534 

1,045 

26 44 


Test the significance of the difference between the two adjusted mean rates of gain. 
Ans. F « 8.70. 

13.7 — Multiple covariance with two criteria of classification. 

Three or more variates may be required to give complete information 
about individuals classified by two criteria. A typical example is the 
randomized block in which one may have data on stand, percentage of 
plants with ears, and yield, for example, in varieties of corn. The variety 
yields may then be compared on the basis of equal stand and bearing 
plants. The computation is little different from that already described. 

As illustration we select data from an interesting experiment (1) (11) 
designed to forecast the yield of wheat in Great Britain from measurements 
on the growing plant. In the present early stages of the investigation, 
it appears that most of the available information is contained in the two 
characters, shoot height at ear emergence (H) and plant numbers at 
tillering (N). Data for these variates together with yield are set out in 
table 13.10. 

The problem is two-fold. First, a regression equation is needed for 
forecasting the year’s yields. Second, the success in estimating must be 
gauged by applying the equation to the known data. 

The desired equation must be free from the effects of season and 
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TABLE 13.10 

Heights op Shoots at Ear Emeroence (H), Number of Plants at Tillering (JV), 
and Yield (F) of Wheat in Great Britain 
(//, inches; JV, number per foot; F* cwt. per acre) 




Place 




Seale 

Rotham- 

New- 

Bog- 

Sprows- 

Plump- 

Year 

Year 

Variate 

Hayne 

sted 

port 

hall 

ton 

ton 

sums 

1933 

H 

25,6 

25.4 

30 8 

33 0 

28 5 

28 0* 

171.3 


N 

14 9 

13 3 

4 6 

14.7 

12 8 

7 5 

67.8 


Y 

19 0 

22 2 

35 3 

32 8 

25 3 

35 8* 

170 4 

1934 

H 

25 4 

28 3 

35.3 

32 4 

25.9 

24 2 

171 5 


N 

7 2 

9 5 

6 8 

9 7 

9 2 

7 5 

49 9 


Y 

32.4 

32 2 

43.7 

35 7 

28 3 

35 2 

207.5 

1935 

H 

27.9 

34 4 

32 5 

27 5 

23.7 

32 9 i 

178.9 


N 

18 6 

22 2 

10 0* 

17.6 

14 4 

7 9 

90 7 


Y 

26 2 

34 7 

40 0 

29 6 

20 6 

47 2 j 

198 3 

Place 

H 

78 9 

88 1 

98 6 

92 9 

78 1 

85 1 

521.7 

Sums 

i JV 

40 7 

45 0 

21 4 

42.0 

36 4 

22 9 

208.4 


Y 

77 6 

89 1 

119 0 

98 1 

74 2 

118 2 

576 2 


* Estimated value. 


locality. From the total sums of squares and products, therefore, must be 
deducted amounts based on year and place sums. The final stages of 
the computation are recorded in table 13.11. For total , the calculations 
are based on the entire 54 records in table 13.10, ignoring both place 
and season. The method is identical with that described in section 13.2 
leading to lines 3 in tabic 13.2. 

The numbers in the second line opposite place arc derived from the 
place sums at the bottom of table 13.10, the calculation following the 
form usual in analysis of variance. For example, in line //, column JV, 

(78.9) (40.7) + . . . + (85.1)(22.9) (521 .7) (208.4) 

= —4706 

3 18 

The set of six correction terms is the same for total, place, and year. The 
entries opposite season are calculated similarly from the year sums. The 
remainders in the six cells of the table are the sums of squares and products 
for discrepance which, in an experiment of this type, are ascribed to error. 
These lead to the desired regression, independent of locality and season, 
recorded at the bottom of the table. 

The values of H and N for each place and year arc substituted suc- 
cessively in the regression equation, leading to the estimated yields of 
table 13.12. One notes that the yields in 1933 are less than average, 
while those in 1935 are more. All the Plumpton yields are greater than 
average. The job is notably well done, however, when the small amount 
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TABLE 13.11 

Analysis of Variance and Covariance in Wheat Data. Regression 
Statistics for Error 


Source of Variation 

Variate 

Sums of Squares and Products 

Height 

H 

Number 

N 

Yield 

Y 

Total 

H 

230.53 

- 0.65 

341 25 

Place 


106 34 

- 47 06 

190.83 

Season 


6 26 

26 24 

8 41 

Remainder (error) 


117 93 

20 17 

142.01 

Total 

N 


385 07 

-300.75 

Place 



171 46 

-257 03 

Season 



139 41 

- 22 26 

Remainder (error) 



74 20 

- 21 46 

Total 

Y 



982 30 

Place 




629 22 

Season 




124 42 

Remainder (error) 




228 66 


For remainder (error) 

r us — 0 2156, ruY — 0.8648, rsy — — 0.1647 
b' YH .s = 0.9442, b' Y s.H = -0.3683, R 2 - 0.8772 

f = 1 3147// - 0.6466A' + 1.393 





Place -f error 

H 

224 27 - 26 89 332 84 


N 

245 66 -278 49 


Y 

857 88 

For place 4- error, R 2 = 0.8494 





Errors of Estimate 



Yield 






Source of 

Sum of 


Degrees of 

Sum of 

Mean 

Variation 

Squares 

R} 

Freedom 

Sq uares 

Square 

Place -f- error 

857 88 

0 8494 

13 

129 20 


Error 

228.66 ' 

0 8772 

8 

28 08 

3 51 

Place 



5 

101 12 

20 22 


F = 5.76*, 


d f. ~ 5 and 8 
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TABLE 13.12 

Actual and Estimated Yields of Wheat 


Place 

1 

Ac- 

tual 

1933 



1934 


1935 

Sum 

Ex- 

pect- 

ed 

Errors 
of Es- 
timate 

Ac- 

tual 

Ex- 

pect- 

ed 

Errors 
of Es- 
timate 

Ac- 

tual 

Ex- 

pect- 

ed 

Errors 
of Es- 
timate 

Seale Hayne 

19 0 

25 4 

-6 4 

32.4 

30 1 

2 3 

26.2 

26 0 

0.2 


Rothamsted 

22 2 

26 2 

-4 0 

32 2 

32 5 

-0 3 

34 7 

32 3 

2.4 


Newport 

35.3 

38.9 

-3 6 

43 7 

43 4 

0 3 

40 0 

37.7 

2 3 


Boghall 

32 8 

35 3 

-2 5 

35 7 

37 7 

-2 0 

29 6 

26 2 

3 4 


Sprowston 

25 3 

30 6 

-5 3 

28 3 

29 5 

-1 2 

20 6 

23 2 

-2 6 


Plumpton 

35 8 

33 4 

2 4 

35 2 

28 4 

6 8 

47 2 

39 5 

7 7 


Sums 

1 

! 

-19 4 

1 

5 9 

13.4 

-0 1 


of data is considered. With the accumulation of information and experi- 
ence there is hope for even closer estimates. 

One should ponder the fundamental character of the regression equa- 
tion for error. It forms a substratum on which are built the variations in 
yield with place and season. It has been used heretofore for adjusting 
means and for testing significance. It constitutes an estimate of the regres- 
sion of Y on the independent variates freed from the controlled variation 
in column and row sums. 

In other experiments similar to this there may be occasion for tests of 
significance. The proper method is illustrated by testing the effect of 
place on yield of wheat adjusted for height and plant number. In the 
lower part of table 13.11 are entered the two additional steps in the test. 
First, the corresponding sums of squares and products in place and error 
having been added, the new R 2 = 0.8494 is calculated. The test of signifi- 
cance is then made in the usual manner. 


13.8 — Partial correlation. Related to partial regression in somewhat 
the same way that r is related to b in total regression (section 7.4) is the 
partial cor relation. Among the variables A'!, A* and A r 3 , ri 2 .3 is an estimate 
of the correlation between A\ and .Y 2 in a population whose members all 
have the same A' 3 . More realistically it is a kind of average of the correla- 
tions in groups of individuals classified on the basis of A r 3 . For example, 
the partial correlation between age and gain independent of initial weight 
among the swine measurements of table 13.1 is approximated from 
table 13.4: 


— 9.1860 

\/ (2,744.78) (0.8047) 


-0.196 


You see immediately that r Y 1.2 gives information like that got from 
b Y i. 2 . In fact, one definition of partial correlation is a generalization 



358 


Statistical Methods 


Section 13.8 


of that in section 7.4. If one first estimates A\ from X% independent of 
Xz ( 612 . 3)5 then A r 2 from Xi ( 621 - 3)5 he may define the partial correlation as 

ri2-3 = V 6 12- 3 621.35 

the geometric mean of the two partial regressions. This emphasizes the 
fact that partial correlation is peculiarly appropriate for the study of 
relations among sets of measurements made simultaneously where there 
is no more reason to estimate Xi from A " 2 than to estimate X 2 from AT 
If X\ follows after X% and A T 3 , or is caused by them, the partial regressions, 
612.3 and 613 . 2 , usually give the information desired. 

A consequence is that one commonly wishes to know about partial 
correlations in a group of total correlations where regression is out of the 
picture. The convenient formula is then 

^12 — fi3 r 23 

ri2-3 — — : — iriTrr 

v (l — ri 3 2 )(l - r23 2 ) 

For an illustration we use some correlations among ear circumference e , 
cob circumference c , and number of rows of kernels k calculated from 
measurements of 900 ears of corn (2) : 

r M = 0.799, r ek = 0.570, r cJfe = 0.507 

One might inquire if r ec is conditioned by the number of kernel rows. 
Since 

0.799 - (0.570) (0.507) 

Tcc'k , - ~ - 0.720, 

\/(l - 0.325) (1 - 0.257) 

the conclusion would be that k has little effect on the correlation — it is 
much the same whether k varies or is constant. Another statement is this: 
the average correlation in groups of ears having the same number of 
kernel rows is little different from the total correlation. This statement 
is a generalization of the first. The information furnished by partial 
correlation is independent of values of the eliminated variable. This 
eliminated variable is not necessarily held constant. 

Another way to tackle the problem of the foregoing paragraph is to 
compute 

0.507 - (0.799)(0.570) 

r (k ., = - = 0.105 

ViT^ 0.638) (1 - 0.325) 

Thus, cob circumference and kernel rows are practically uncorrelatcd 
within groups of ears having the same circumference. 

For testing the significance of partial correlations use table 7.3 (or 
table 13.6, 2 variables), but enter it with degrees of freedom n — 3. In 
general, the number of degrees of freedom for a partial correlation is the 
number of sets of observations minus the number of variates involved. 
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EXAMPLE 13.11— 'For the pig data substitute the values of r from table 13.2 in 
the formula for rn -2 with the result — 0 . 190 . Compare this with the average correla- 
tion, — 0.196, within the groups of table 13.3. 

EXAMPLE 13.12 — Verify the fact that partial correlation is the geometric mean 
of two partial regressions by using results from section 13.3 and example 13.5. 

EXAMPLE 13.13 — In the same way verify the formula, 

ri 2 - 3 8=8 y/b' 

EXAMPLE 13.14 — The data of examples 13.1 and 12.1 lend themselves to the 
verification of the statements made about partial correlation and of the two formulas 
given for ru-*. 


13.9 — Four or more variates in a single group. Increasing beyond 
three the number of measurements on each individual though involving 
no new principles does introduce increasing complexity into the methods 
of multiple regression. Only the fundamentals will be presented here. 
Laboratory methods of calculation, together with more extensive discus- 
sions may be found in specialized publications (3) (4) (5) (10). 

Amateurs should be cautioned not to rush headlong into regression 
studies involving many variables. Some people seem to think there is 
magic in the collection of vast amounts of data — that by some alchemy 
multiple regression will yield authentic information from careless measure- 
ments on heterogeneous material. The fact is that hazards increase with 
the extent and complexity of the investigation. Danger of the loss of data 
or failure of techniques, difficulty of finding homogeneous material, per- 
plexities involved in interpretation, these are a few of the obstacles. How- 
ever, do not be deterred if you have well taken measurements on carefully 
chosen material, and if you have definite questions whose answers lie in 
the methods to be used. As compared to the labor of getting the data the 
calculation of regression statistics is easy. 

Should you have four or more variates in several subsamples of indi- 
viduals, the subsamplcs being representative of somewhat different but 
related populations, methods analogous to those of sections 13.6 and 13.7 
are available. 

One should be increasingly cautious of including more variables in 
small samples. The reason is readily explained by an appeal to the 
geometry of regression, but we shall have to start with two variables. 
In a plane, two pairs of observations fix a linear regression, all additional 
pairs being available not only for improving the regression but for an 
estimate of error as well. In three-space the required number of triplets 
of observations to fix a regression plane is three. The analogy may be 
extended to four and more variables despite the limitations of our three- 
dimensional world. A regression in six variables would fit perfectly 
(R = 1) six sets of observations, leaving no information about error. 
Don’t get excited, then, if you get R = 0.9 in a six- variable regression 
with only 10 sets of observations. As you will find later, that value is not 
even significant. One can easily be deceived by a large R if n is little 
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greater than the number of variates. With small samples be meticulous 
about all tests of significance, then be a bit skeptical besides. 

13.10 — Calculation of regression with four variates. For illustration 
we have chosen a sample of four measurements on each of 44 honey bees 
(8). The immediate object is to learn what dimensions, easily obtained, 
best indicate the length of proboscis. The latter is difficult to measure 
directly. The possibility of selecting bees for length of proboscis is of 
interest in Iowa because of the problem of pollination of red clover, 
the nectar in which cannot be reached by the ordinary honey bee. The 
variables are dry weight in mgs. (A r i), length (A^), and width (A'a) of 
right forewing in mm., and length of proboscis (F), also in mm. 

The sums, sums of squares and sums of products in table 13.13 were 
calculated from the experimental data in the manner now familiar. The 
correction terms and the remaining computations of the correlation coeffi- 
cients need only be mentioned. A word should be said about the number 
of significant figures. The correction terms are almost as large as the sums 
of squares of the observations, indicating small variation. In the remain- 

TABLE 13.13 

Computation of Correlation Coefficients Among Four Measurements 

on Honey Bees 


N - 44 

Dry 

Weight 

X\ 

Length 
of Wing 

Xt 

Width 
of Wing 

Length of 
Proboscis 

Y 

Sum 

576 4 

422 7 

144 5 

290 0 

Mean 

13 10 

9 61 

3 28 

6 59 

Xi 1 . SX 2 , etc. 

7567 5240 

5539.2979 

1893 7740 

3800 5057 

2. Correction 

7550 8400 

5537 3700 

1892 9500 

3799 0000 

3. Sx 3 , etc. 

16 6840 

1 9279 

0 8240 

1 5057 

4. x/Sx\ etc. 

4 0846 

4 0691 

1 9365 

3 3759 

5. r’s 


0 4738 

0 4255 

0 4460 

X 2 1. SX*, etc. 


4061.7944 

1388 5203 

2786 5762 

2. Correction 


4060 8020 

1388 1852 

2785 9773 

3. Sx 2 , etc. 


0 9924 

0 3351 

0 5989 

4. x/Sx 2 . etc. 


0 9962 

0 4723 

0 8234 

5. r’s 



0 7095 

0 7274 

Xi 1. SX 2 , etc. 



474 7759 

952 5712 

2. Correction 



474 5511 

952 3864 

3. Sx 2 , etc. 



0 2248 

0 1848 

4. \/Sx 2 , etc. 



0 4741 

0 3918 

5. r 




0 4716 

Y 1 . SY 2 




1912 0467 

2. Correction 




1911 3636 

3. Sf 




0 6831 

4. vV 




0 8265 

5. j. 




0 1260 
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ders of lines 3 the last figure and in some cases the last but one have little 
relation to the experimental measurements. They must be retained, how- 
ever, in this long series of operations in order that rounding may not 
impair the accuracy of the calculations. 

There are now three normal equations to be solved for the three 
standard partial regression coefficients required in the regression equation, 

# = b'yi.2zX f i + b' Y 2-13X f 2 + b f Yi. 12*Y 

in which the dashed *’s represent standard measure. The symbol b Y \-n is 
read, “the standard regression coefficient of Y on X\ independent of 
X?, and AY” The normal equations are: 



i 

! 

* : 

; 

^ ; 

Xi 

b'yuu 4" Tub'yi'iz 4~ *» r Y i 

Ah 

T\tb'yi.it 4“ b'yt.iz -f fu^YVlt = r Y t 


rub'yi.u 4- risb'yz.n 4~ b’rv it ** rn 


Notice the symmetry in these equations. The subscripts of the appro- 
priate r’s are easily determined from those of the A^s. 

Substituting the r’s from table 13.13: 


Ah 


Ah 


Ah 


0 

0 


Ah Ah Ah Y 

b'yi'ii + 0 4738 b ' FM8 + 0 4255 ^h'3-12 = 0 4460 (1) 

4738 b'rx.n + b' Y2 . n + 0 7095 b'ya.n « 0 7274 (2) 

4255 b'n-n + 0 7095 b'y^n *f A'ri . n - 0 4716 (3) 


This set of simultaneous linear equations is solved by the ordinary methods 
of elementary algebra. A good way to start is to divide each of the equa- 
tions 1 and 2 by its coefficient of b' YZ .n\ that is, by 0.4255 and 0.7095 
respectively. The results are 

2.3502 b’ Y i-u + M135 b' Y i.\z + b’ YZ . u = 1.0482 (4) 

0.6678 b' Yh 23 + 1.4094 b' Yt . « + b f Y3 . l2 = 1.0252 (5) 

With a calculating machine the divisions are made most easily by use of 
the reciprocal. For example, 1/0.4255 = 2.35018, the leading coefficient 
of equation 4. The products of this reciprocal by 0.4738, 0.4255, and 
0.4460 are the remaining coefficients of equation 4. 

Now subtract equation 3 from 4, then 5 from 4: 

1.9247 V Y i. 2 3 + 0.4040 V Y 2 . 13 = 0.5766 (6) 

1.6824 b' Y 1.23 - 0.2959 b' Y v n = 0.0230 (7) 
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Multiply each of these equations by the reciprocal of its coefficient 
of b'yi-n' 

4.7641 b'n.u + i'n.u = 1.4272 (8) 

- 5.6857 b' Y i.u + b' Y 2 .i8 = —0.0777 (9) 

Subtracting 9 from 8, 

10.4498 b' Y i.23 = 1.5049, (10) 

whence b f Y 123 == 0.1440 (11) 

This value is to be substituted back in equation 8 (or 9 or both) yielding, 

(4.7641)(0.1440) + V Y 2.13 = 1.4272, 
from which 6V2.13 = 1.4272 - 0.6861 = 0.7411 

Substituting both these 6-primes in one of the original equations, 3 for 
example, 

(0.4255) (0.1440) + (0.7095)(0.7412) + 6V3.12 = 0.4716 
*V«. 12 = 0.4716 - 0.0613 - 0.5259 - -0.1156 

For verification, the three 6-primes may be substituted in each of the 
remaining equations, 1 and 2. Occasionally a small r is multiplied by a 
small 6' causing the product to have no figure in the first four decimal 
places. Then the equation may seem to verify the 6-primes despite the 
fact that the small one may be in error. For that reason, it may be as well 
to verify them in both the original equations. 

For convenience and later comparisons we bring together the following 
statistics: 


Proboscis Length 

Dry Weight 

Wing Length 

Wing Width 

Y 

X, 

x% 

X, 

Correlations of Y with X’s 

0 4460 

0.7274 

0 4716 

Standard regressions of Y on X’s 

0.1440 

0 7411 

-0 1155 


Although not needed for this example, the regression equation is calcu- 
lated following the method of section 13.3: 

f = 6.59 + 0.1440j^~ {X\ - 13.10) 

4.0846 

0.8265 


+ 0.7411 


0.1155 


0.9962 

0.8265 

0.4741 


(X t - 9.61) 
(X t - 3.28) 


0.0291.3fi + 0.6148Zi - 0.2014X, + 0.96 
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If the 44 estimated values, f', were computed then correlated with actual 
values Y , the result would be (section 13.4), 

R 2 = (0.4460)(0.1440) + (0.7274) (0.741 1 ) 

+ (0.47 1 6) ( — 0.11 55) = 0.5488 

R « 0.7408 

Since this value is little larger than r Y 2 — 0.7274, small advantage has 
been gained by the multiple regression so far as estimating proboscis 
length is concerned. One might almost as well use the simple regression 
of proboscis length on wing length. 

Another indication of the outstanding role of wing length in estimating 
proboscis length is the predominating size of b' Y 2 . 1 s = 0.7411. In the 
presence of wing length the other two variates give little information. It 
is notable that the total correlations do not even denote the order of im- 
portance of the three independent variables. This is a warning not to 
try to interpret zero order correlations except in the light of the pertinent 
information contained in the 6 -primes. 

Another remark may be interpolated just here. It is usually futile to 
argue about cause-effect relations from information furnished by a single 
small correlation even though it be highly significant. Why? If this small 
correlation were included in a group having a large /?, the 6 -prime corre- 
sponding to it might be found to contradict its indication, as in the example 
being discussed. If you wish reliable information about relations, get 
together a group of variables whose connections have theoretical validity 
then, if R is large enough to indicate fairly complete determination, ex- 
amine the 6 -primes. 

As in the first example of this chapter, it is interesting to note the 
change of sign from r YZ = 0.4716 to b' Y 3.12 — —0.1156. If data were 
sufficiently plentiful it would be possible to gather into subsamples those 
quadruplets of measurements in which dry weight X\ and wing length Xi 
had some constant pair of values. It would be found that the average 
regression of proboscis length on wing width in such subsamples would 
be —0.1156. This regression would be independent of X\ and Xu because 
these two variates would be unchanging within each subsample. An 
interpretation may be ventured. While both wing width and wing length 
are affected alike by body size (r 2 3 = 0,7095), and while length of wing 
and proboscis are closely related (r r2 — 0.7274, 6 ' r 2 -ia = 0.7411), yet 
in groups of uniform body size and wing length the stocky bee tends to 
have a wide wing and short proboscis. 

The significance of R is tested by referring to table 13.6, looking down 
the column for four variables until you get to 40 degrees of freedom (44 
observations minus 4 variables), then comparing R = 0.7408 with the 5% 
and 1% values, 0.419 and 0.494. We conclude that there would be little 
chance of drawing our sample R from a population of uncorrelated variates. 
The significance of the regression coefficients will be discussed in section 
13.12. 
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An alternative method for testing the significance of R is more enlighten- 
ing though slightly less convenient. Analyze the variance of Y (Sj? 
= 0.6831 in table 13.13) in this manner: 



Degrees 

Sum of Squares 



of 



Mean 

Source of Variation 

Freedom 

Symbolical 

Numerical 

Square 

Linear regression 

3 

R*Sf 

0.3749 

0 125 

Deviations from regression 

40 

(1 - R*)& 

0.3082 

0.0077 

Total 

43 

sf 

0.6831 



F « 0.125/0.0077 = 16.2, dj \ - 3 and 40 


The conclusions, of course, arc the same as before. 

13.11 — Direct method of computation: 3 variates. Although corre- 
lation coefficients have been used extensively for regression calculations, 
they are relatively new in the history of statistics, having been introduced 
near the beginning of the present century. An older method involved the 
sums of squares and products, the normal equations for 3 variates being, 

Sx\b Y \*2 + Sxixzhyt.i = Sxy 
SxiXlbyl .2 + Sx<?by 2 . 1 = Sx 2 y 

These are solved directly for the b’s in the same way the correlation equa- 
tions were solved for the ^-primes: 

- ~ (S x i x i)(Sx*y) 

}1 2 (Sxi 2 )(Sx<2 2 ) — (5a' 1 X 2 ) 2 

__ (Sxi 2 )(Sx 2 y) — (SxiXt)(Sxiy) 

121 (Sxi 2 )(Sx 2 2 ) - (Sxix 2 ) 2 

If you substitute the sums of squares and products for the pig data, you 
will get b r i '2 = —0.003290 and by 2 .\ = 0.007409 as before. The regres- 
sion equation becomes, 

f = 1.3882 - 0.003290(A', - 77.050) + 0.007490(AT 2 - 52.725), 

which reduces to that calculated earlier. 

The sum of squares due to regression is 

Sf = byi.tSxy + b Y t.\Sxiy 

= ( — 0.003290) (6. 9235) + (0.007409)(27.5408) 

= 0.1813, 

so that the sum of squares for deviations is 

Sdy.n* = Sf - Sf 

= 1.0228 - 0.1813 = 0.8415 
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EXAMPLE 13.15 — The following data are adapted from a report by Pearl (9). 
The problem was to discover what factors might have been related to the intensity of 
the influenza epidemic of 1918. The onslaught was known to have varied widely in 
different cities: could this have been attributed to measurable characteristics of the cities 
themselves? 


City 

Age 

Distribution 
of Population 

Latitude 

(Degrees) 

Deaths per 
100,000 From 
Heart Diseases 

25-week Excess 
Mortality 
(per 1,000) 

i Xi 

*2 

A's 

Y 

1. Albany 

11 

43 

236 

47 

2. Atlanta 

83 

34 

122 

27 

3. Baltimore 

32 

39 

196 

61 

4. Boston 

31 

42 

212 

65 

5. Buffalo 

49 

43 

166 

58 

6. Cambridge 

30 

42 

184 

59 

7. Chicago 

69 

42 

160 

38 

8. Cincinnati 

23 

38 

202 

40 

9. Cleveland 

75 

42 

118 

40 

10. Columbus 

29 

40 

156 

32 

11. Dayton 

24 

40 

184 

35 

12. Fall River 

73 

42 

164 

58 

13. Grand Rapids 

28 

43 

146 

15 

14. Indianapolis 

24 

40 

186 

25 

15. Louisville 

32 

38 

164 

36 

16. Los Angeles 

14 

34 

156 

52 

17. Lowell 

35 

43 

158 

51 

18. Milwaukee 

62 

43 

100 

29 

19. Minneapolis 

55 

45 

114 

27 

20. Nashville 

48 

36 

198 

78 

21. Newark 

63 

41 

148 

51 

22. New Haven 

34 

41 

186 

56 

23. New Orleans 

52 

30 

224 

72 

24. New York 

74 

41 

166 

47 

25. Oakland 

17 

38 

188 

59 

26. Philadelphia 

34 

40 

204 

73 

27. Pittsburgh 

72 

40 

134 

80 

28. Providence 

30 

42 

152 

53 

29. Rochester 

27 

43 

204 

27 

30. St. Louis 

45 

39 

142 

30 

31. St. Paul 

69 

45 

118 

33 

32. San Francisco 

34 

38 

238 

75 

33. Toledo 

33 

42 

186 

21 

34. Washington 

21 

39 

236 

66 

Sum 

1,432 

1,368 

5,848 

1,616 


The correlations are as follows: 



x* 

AT, 

Y 


0.07798 

-0 61118 

-0 02581 



— 0.30774 

-0 34819 

X 9 



0 48446 


You will get R 1 » 0.3773 and the regression equation, 

f = 0.35313X1 - 0.95405X: + 0.33149X, + 14.026 



366 


Statistical Methods 


Section 13.11 


Finally, R 2 = Sf/Sy 2 

= 0.1813/1.0228 = 0.1773, 

as in section 13.4, except for negligible rounding errors. 

For testing the regression coefficients and for making fiducial state- 
ments there are needed 3 quantities called multipliers. These were intro- 
duced by Gauss (7): they are often referred to as “elements of the inverse 
matrix.” They all have the same denominator, 

D = (&!*)(& 2 2 ) - (Sx lX2 y 

= (4,735.90)(5, 065.98) - (3,037.55) 2 
= 14,765,300 

This same denominator appeared in the formulas for the b y s; so its re- 
ciprocal, 0. 677265/1 0 7 , may be set in the machine for multiplying the 
numerators of all five fractions. The multipliers are, 

cix = Sx 2 2 /D = (5,065.98) (0.677265)/10 7 = 0.00034310 

C22 = Sx?/D = (4,735.90) (0.677265)/! 0 7 = 0.00032075 

cn = -Sx lX2 /D = — (3,037.55)(0.677265)/10 7 - -0.00020572 

We now calculate the variance of deviations from regression, 

Sd Y . n 2 /(n - 3) = 0.8415/37 = 0.02274 

from which, s Y . 12 — 0.1508. From the above come the following standard 
deviations and values of t : 

For by !. 2 : j * « Sy.nVcu = 0.002793, t = 0.003290/0.002793 = 1.18 
For by 2 . x : * = ^. 12^22 = 0.002701, / = 0.007409/0.002701 = 2.74 

The values of t are identical with those for the corresponding /;-primes: 
it is a matter of preference which method is used for making the test. 

Since we now have the standard deviation of b Y 1 . 2 , we may set 95% 
fiducial limits, 

— 0.003290 ± (2.026) (0.002793) ; 

that is, from —0.008949 to 0.002369. This shows the unreliability of our 
estimate of this partial regression — it might be positive in the population 
instead of negative as in the sample. Of the other coefficient the limits are 
from 0.001937 to 0.012881 pound per day per pound initial weight. 

For the fiducial limits of any F, the variance is computed by, 

*r-12 2 (l /n + Cn*i 2 + C 2 2* 2 2 + 2ri2*i* 2 ) 

where x% and x 2 are deviations from their respective means. As an ex- 
ample, let us calculate the limits for the rate of gain estimated for pigs 
weighing 60 pounds at 60 days of age. Substituting in the regression 
equation, the predicted rate is 

P = 1.3882 - 0.003290(60 - 77.050) + 0.007409(60 - 52.725) 

= 1.50 pound per day, 
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with the variance, 

0. 02274[l/40 + (0.0003431 0) ( — 1 7.050) 2 + (0.00032075) (7,275) 2 

+ 2( — 0.00020572) ( — 17.050) (7.275)] = 0.004384 

Thus, the standard error is 0.0662. Finally, the 95% limits are 1.50 & 
(2.026) (0.0662); that is, from 1.37 to 1.63 pounds per day. 

The variance of the estimated rate of an individual pig could be found, 
as in section 6.9, by adding s y . 12 2 = 0.02274 to 0.004384. 

13.12 — The Gauss multipliers with more than 3 variates. As the 

number of variates increases the computations grow heavier, but no new 
features are added. For 4 variates, the normal equations in the direct 
method are, 

Sx^byi* 23 + Sxix 2 b Y 2'U + Sx\X 3 b Y z-n — Sx\y 
Sx\X2b Y i-2z + Sx 2 2 b Y2 . u + Sx'iXzb Y z.ii = Sx 2 y 
SxiXzb Y \.n + Sx 2 xzb Y 2.n + Sxz 2 b Y3 .u = Sx^y 

After substitution of sample data for the sums of squares and products, the 
three 6 s s may be solved for by the usual methods; the computations of the 
regression equation, S$ 2 , Sd Y .ns 2 , and JR 2 parallel those for 3 variates. 
Instead of giving these details, we shall show how to compute the Gauss 
multipliers and to use them for completing the regression calculations. 
This may be done in either the correlation or the direct method: we shall 
do the latter, then show later some of the relations. 

The equations giving the multipliers are got from the normal equations 
by replacing the b' s by c' s and the right members successively by 1, 0, 0; 0, 
1,0; and 0, 0, 1; then solving each set for the c’s. For example, in the 
first set, using the bee data in table 13.13: 

16.6840 c n + 1-9279 c n + 0.8240 c 9X = 1 

1.9279 c u + 0.9924 c t i + 0.3351 c 9X = 0 

0.8240 cn + 0.3351 c %l + 0.2248 c n = 0 

Solving as before, c\\ — 0.07892, c 2 \ — —0.11202, and c s i ~ —0.12230. 

Solving the second set, with 0, 1, 0 on the right, C\ 2 = —0.11202 
— c 2 u c 22 — 2.18790, cz 2 = —2.85079. The third set with 0, 0, 1 on the 
right, gives c u — —0.12230 = r 3 i, cn = —2.85079 = r 3 2 , and c 3 s 
= 9.14624. These elements are entered in table 13.14 along with the 
sums of products of Y by the A^s. At first, the computation may seem 
forbidding, but actually it is not difficult. Some short-cuts will be shown 
below after the utility of the multipliers has been presented. The following 
statistics may now be calculated: 

1. The regression coefficients. One of them is taken from each column of 
multipliers in the table. As an example, from solution 2, 

b Y 2.18 == c i 2 Sx\y + £22*$* f 32^*3 V 

= ( — 0.11 202) (1 .5057) + (2.18790) (0.5989) 

+ ( — 2. 85079) (0.1 848) = 0.61484 
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TABLE 13.14 

Three Sets of Multipliers Together With Sums of y - Products and Resulting 
Regression Coefficients. Bee Data 


Multiplier 

Solution Number 

Sums of 
^-products 

1 2 3 

Cl 

cu « 0 07892 cu » —0.11202 c n * -0.12230 

1.5057 

Ci 

c 21 « -0.11202 c 22 - 2 18790 r 28 - -2 85079 

0.5989 

Cl 

di - -0 12230 c%% * -2.85079 c 83 - 9.14624 

0.1848 


Regression 

Coefficients 0 02914 0.61484 -0.20126 


The regression equation is then computed as usual. 

2. The sum of squares attributable to regression. 

Si) 2 = byi-zsSxiv + by2.nSx2y + by^nSx^y 

= (0.02914)0.5057) + (0.61 484) (0.5989) + ( — 0.201 26) (0.1 848) 
= 0.3749 

3. The sum of the squares of deviations from regression. 

Sd Y . 123 2 = Sf - Sy 2 = 0.6831 - 0.3749 = 0.3082 

4. The variance and standard error of estimate. 

sy.uz 2 = Sdy.m/d.f. = 0.3082/37 = 0.007705. j r . 123 = 0.08778 

5. The multiple correlation coefficient. 

R 2 = Stf/Sf = 0.3749/0.6831 = 0.5488. R = 0.7408 

6. The standard deviations of the regression coefficients. As an example, 
for by 2 <iz' 

S = s Y . i2s\/r22 = (0.08778) (1 .4792) = 0.01298 

The testing of significance and setting of fiducial limits now proceeds as 
usual. 

7. The standard deviation of the difference between two regression coefficients. 
This is seldom wanted, but if there were interest in testing the hypothesis 
that the regression on wing length and on wing width are the same, the 
variance of the difference is 

+ £22 2 ci 2 ) 

= 0.007705(0.07892 + 2.18790 - 2[— 0.11202]) 

= 0.019192 

Hence, the standard error is 0.1385, so that the difference, 0.61484 — 

( — 0.20126) = 0.81610, is tested by / = 0.81610/0.1385 = 5.89, <//. = 37. 
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J 8. The standard error of T. The variance of this estimate is 

i y 

Sy 128 2 (l/tt + Cl%Xi* + C% 2 X 2 * + + 2CuX&2 + 2c\zXiXz + 2cnX%Xz) } 

♦ 

where the *’s are deviations from their respective means. The standard 
error may be used for calculating fiducial limits of F and (after the usual 
modification) of F. 

EXAMPLE 13.16 — In the direct method, it is advantageous to code the data so 
as to make the ranges of the independent variates as nearly alike as feasible. For 
the influenza data of example 13.15, this may be done as follows: (i) subtract 30° 
from each X% , then multiply by 6; (ii) divide each Xi by 2. Since this involves no round- 
ing, the final results will be unaltered. The values of the variates for the first two cities 
arc: 

Albany: 11,78,118,47 
Atlanta: 83, 24, 61, 27 

Complete the table. 


EXAMPLE 13.17 — Using the coded data of the preceding example, compute the 
sums of squares and products: 



X\ 


■Xi 

Y 

Xi 

13,657 53 

1,008.35 

-7,501.00 

- 310.12 

Xi 


12,244.24 

-3,576 00 

-3,961.41 

Xi 



11,028.00 

5,231.00 

Y 1 




10,572.47 


EXAMPLE 13.18 — From the foregoing sums of squares and products, calculate 
the regression coefficients, 

0.35316, -0.15808, 0.66300 

To compare these with the coefficients in example 13.15, you will have to reverse the 
coding process, dividing —0.95405 by 6 and multiplying 0.33149 by 2. The discrepan- 
cies in the fifth place of decimals are the accumulated effects of rounding in the two 
solutions of the normal equations. 

13.13 — Deletion of a variate. After a regression is completed, the 
value of a variate may be questioned and its omission proposed. The 
computations with the reduced number of variates may, of course, be 
carried out anew, but desired information is often to be had more easily 
by means of the formulas below (6). 

For the bees, the regression on width of wing is small and nonsignificant. 
If this variate, Xz, is deleted, the sum of squares due to regression is de- 
creased by 

iWAsa = (~0.20126) 2 /9. 14624 = 0.0044, 
leaving for the reduced regression, 

W « 0.3749 - 0.0044 = 0.3705, 

the subscripts indicating that # is now estimated from X\ and X 2 only. 
The corresponding multiple correlation coefficient is 

, Ryv? « S$ n ySf * 0.3705/0.6831 - 0.5424, 

Ry 12 = 0.7365, d.f. - 38 



TABLE 13.15 

Computation of the Multipliers for the Bee Data of Table 13.13. Entries Are Rounded to 4 Decimals, But the 

Original Calculation Required 6 
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If the reduced regression is decided upon, the two partial regression 
coefficients are calculated easily: 

by 1*2 *= byi.zz — £l3(^Z3*12A8s) = 0.02645 
by %- 1 = byz-iz — ^ 23 (^ 3 * 12 /^ 33 ) == 0.55212 

Or, one may prefer to calculate the multipliers in the reduced regres- 
sion. These are, 

fcn ) r = £n ~ Ci^/czt = 0.07728 
(^ 22 )r = £22 — Cz^jczz = 1.29934 
{?i2)r — £12 — c\zP2z/czz ~ —0.15014 

From these, all the statistics in the 3- variable regression may be calcu- 
lated, including those given above. 

Formulas for deleting a different variate may be inferred from those 
given: only keep in mind the number of the variate being eliminated and 
that of the one for which the statistic is being computed. Analogous 
formulas are easily written for deletion in a regression having more than 
4 variates. 

13.14 — Calculation of the multipliers. Dozens of methods are avail- 
able, each offering some advantages. I shall do no more than indicate how 
the elementary solution of simultaneous equations may be systematized. 
For the occasional user, with a moderate number of variates, no time is 
saved by the specialized methods because the procedures must be re- 
memorized each time. Those who do many regressions will no doubt 
adopt one of the more abbreviated schemes (3) (4) (5) (10) already 
mentioned. 

In table 13.15, the solutions of the three sets of equations are carried on 
simultaneously with this advantage: each divisor is applied to an entire 
line in the table by setting its reciprocal in the machine and multiplying 
successively by all the necessary coefficients. Thus, in line 4, the reciprocal 
of the divisor, 0.8240, is entered at the proper place in the table (1/0.8240 
= 1.2136), and is then put on the keyboard for multiplication by 16.6840 
and 1.9279. 

Those familiar with simultaneous equations will have no difficulty in 
following the directions. If you wish to verify the fourth place of decimals 
you will have to carry all calculations to six places. This is necessary in 
order to make cn = cai, etc. Otherwise, rounding will prevent exact 
verification. 

After one becomes familiar with the process he will agree, I think, that 
the computation is not excessive if judged by the amount of information 
obtained. 

EXAMPLE 13.19 — Using the sum of squares and products in example 13.17, calcu- 
late the multipliers, c: 

0.00011944 0.00001534 0 00008621 

0.00009219 0 00004033 

0.00016240 
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13,15— Relations between the correlation and direct methods. 

Each method leads to a complete set of regression constants: the user, 
then has entire freedom of choice. 

Attention has been called to the fact that either set of partial regression 
coefficients can be calculated from the other. For example, 

, Vs/ 

byi-U — b n.23^— 2 

The quantities, R and are derived as parts of either procedure. 

The multipliers, c f , in the correlation method may be computed from 
the normal equations in section 13.10 by changing the 6 -primes to oprimes 
and replacing the right members by 1 , 0 , 0 , etc. The following relations 
will enable one to transfer from one set to another: 


c n = c'u/Sxi 2 , etc. 
c 12 = c f nrn/Sxix^ etc. 

The r-primes may be entered in a table like 13.14, with the correla- 
tions, r„i, r v 2 , etc., in the column at the right; then the columns may be 
combined as before, the sums of products being the 6 -primes. 

For testing significance of the 6 -primes, the quantity, 


1 - R « _ 1 - 0.5488 
n — m 44 — 4 


0.01128, 


replaces s Y - in 2 so that the variance of 6V1.23 is vc'u. The ensuing values 
of t are identical; that is 

6 Vl»23 __ by 1-23 

Vvc\l Sy. 128 \/cil 

With one exception, the formulas for deleting a variate apply to 
primed 6 ’s and c* s as well as to the un-primed. The exception: if X d is 
deleted, R 2 (instead of R 2 Sy 2 ) is decreased by 6 V 8 -i 2 2 A' 88 . 

Mastery of the computational methods enables the investigator to get 
desired information without excess calculation. If it is the regression 
equation with an overall test of significance that is needed, the direct 
solution of the normal equations is indicated. A detailed study of correla- 
tion and regression coefficients calls for the correlation method, either 
with or without the multipliers. On the other hand, the direct method 
with multipliers leads to tests of significance and fiducial statements 
involving T, Y, and the 6 ’s. For maximum flexibility in examining the 
regression, start by calculating the multipliers. 

EXAMPLE 13.20 — From the correlations in example 13.15, compute the multi- 
pliers, c’\ 

1.63117 0.19850 1 05803 

1 12881 0 46876 

1.79093 
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EXAMPLE 13.21 — If you solve the normal equations in section 13.3, substituting 
c r for b ' together with 1,0 and 0,1 in the right members, you will get the multipliers, 


c 11 


1 - r l2 * 


1 c'n 


C 18 


fis 

1 -ria* 


The formulas in section 13.5 may be derived from these multipliers. 


EXAMPLE 13.22 — Delete X\ from the influenza regression. All the results may be 
verified by calculating the statistics of the reduced regression, using the formulas 
which have been given for the multipliers (section 13.11 and example 13.21). 
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Chapter 14 

CURVILINEAR REGRESSION 


14.1 — Introduction. Although linear regression is adequate for many 
needs, it is a matter of common observation that some variates are not 
connected by so simple a relation. The discovery of a precise description 
of the concomitant variation of two or more quantities is one of the prob- 
lems of curve fitting , known as curvilinear regression. From this general view 
the fitting of the straight line is a special case, the simplest and indeed the 
most useful. 

The motives for fitting curves to nonlinear data are various. Some- 
times there is the desire to make a good estimate of the dependent variate 
for any particular value of the independent. This may involve the smooth- 
ing of irregular data and the interpolation of estimated Y's for values of X 
not contained in the observed series. Sometimes the objective is to test or 
discover a law relating the variates, such as a growth curve. At yet other 
times, the form of the relationship is of little interest; the end in view is 
merely the elimination of inaccuracies which nonlinearity of regression 
may introduce into a correlation coefficient or an experimental error. 

The problem of fitting curves to data has been approached from many 
directions. Except when a law is being tested, the chief difficulty is the 
selection of a suitable equation. Graphical devices of rectification (1) and 
more theoretical methods (3) (7) (14) are useful and readily available to 
one who has some acquaintance with mathematics. In the main, we shall 
confine our attention to the exponential growth curve and to the poly- 
nomial. 

14.2 — The exponential growth curve. A characteristic of some of 
the simpler growth phenomena is that the increase at any moment is 
proportional to the size already attained. This is sometimes called the 
law of compound interest. During one phase in the growth of a culture 
of bacteria, the numbers of organisms follow such a law. The relation is 
nicely illustrated by the dry weights of chick embryos at ages 6 to 16 days 
(12) recorded in table 14.1. The graph of the weights in figure 14.1 
ascends with greater rapidity as age increases, the regression equation 
being of the form commonly associated with such growth, 

w=(A)m, 

where A and B are constants to be evaluated. Applying logarithms to the 
equation, 

log W = log A + (log B)X 
or Y = a + bX, 

[ 374 ] 
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TABLE 14.1 

Dry Weights of Chick Embryos From Ages 6 to 16 Days, 
Together With Common Logarithms 


Ages in Days 

X 

Dry Weight, W 
(grams) 

Common Logarithm 
of Weight 

F 

6 

0 029 

-1.538* 

7 

0.052 

-1.284 

8 

0.079 

-1.102 

9 

0.125 

-0.903 

10 

0.181 

-0.742 

11 

0.261 

-0.583 

12 

0.425 

-0.372 

13 

0 738 

-0.132 

14 

1.130 

0.053 

15 

1.882 

0.275 

16 

2 812 

0 449 


* From the table of logarithms, one reads log 0.029 « 8.462 — 10 or 2.462, which 
means —2 0.462 = —1.538. 


where Y = log W, a = log A , and b = log B. This means that if log W 
instead of W is plotted against X, the graph will be linear. By the device 
of using the logarithm instead of the quantity itself, the data are said to 
be rectified . 

The values of F = log W are set out in the last column of the table 
and are plotted opposite X in the figure. The regression equation, com- 
puted in the familiar manner from the columns X and Y in the table, is 

F* = 0.1959A' - 2.689 

The regression line fits the data points with unusual fidelity, the correla- 
tion between F and X being 0.9992. The conclusion is that the chick 
embryos, as measured by dry weight, are growing in accord with the 
exponential law, the logarithm of the dry weight increasing at the esti- 
mated uniform rate of 0.1959 per day. 

Often there is little interest in the regression equation itself, the objec- 
tive being merely to learn if the data follow the exponential law. The 
graph, then, may furnish sufficient information. If soothe use of semi- 
logarithmic or ratio graph paper obviates the necessity for looking up the 
logarithms of F. The horizontal rulings on the paper are drawn to such a 
scale that the plotting of the original data results in a straight line if the 
data follow the exponential growth law. Ratio paper may be purchased 
at most stationery shops. 

It is sometimes considered desirable to write the exponential growth 
law in the form, 

W = Ae bX , 

* The distinction between F and estimated Y is usually obvious. ? will be used 
only when the error of estimate, F — ?, is to be emphasized. 
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Fio. 14.1— -Dry weights of chick embryos at ages 6-16 days with fitted curves. 
Uniform scale: W = 0.002046(1.57)* 

Logarithmic scale: Y ~ 0.1959.Y — 2.689 


where e is the base of the natural or Naperian system of logarithms, an 
irrational number whose approximate value is 2.718. If this form is 
decided upon, the convenient way is to use natural logarithms; then, 

log, I W = log c ^4 + bX 

It may be shown that the regression coefficient, b , in this equation is the 
relative rate of increase of W; that is, the rate of increase per unit X per 
unit W. This relative rate is 2.3026 (= log,10) times the rate indicated 
by the slope of the regression line in common logarithms. Thus, for the 
chick embryos, the relative rate is (2.3026)(0.1959) = 0.4511 gram per 
day per gram. It is clear that the information about relative rate of 
growth can be deduced from either form of the equation. One may just 
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as well use common logarithms if he finds them more convenient, then get 
the correct relative rate by multiplication above. There are no other 
advantages in the natural logarithms unless tables of natural logarithms 
are more readily available than those of common logarithms. 

If there is a necessity for converting the equation, log W = 0.1959X 
— 2.689, into the exponential form,* 

W** (0 .002046) (1.57)*, 
or the formf W = (0. 002046) *r°’ 481lX , 

it must be remembered that the sum of squares of the errors of estimate is 
not a minimum for the exponential. It is the sum of the squares of 
log W — estimated log W which was minimized. If it is important to 
reduce to a minimum the sum of squares of deviations from the expo- 
nential, methods are available (14, VI, and 3, as examples). Ordinarily 
there is no object in reconverting the equation to the exponential form, 
or in making the slight improvement in the constants even if it is so con- 
verted. 

Workers in other lines may wish to test different theoretical regressions. 
Occasionally the simple inverse proportion is indicated, 

Y = 1/X 

Other possibilities are Y = log X, Y = \JX and log Y — a + b log X. 
The testing of the applicability of the proposed law should first be done 
graphically. Should the data appear to be rectified by the transformation 
of one or both the variates, then proceed with the regression computations 
if desired. For testing the last of the above equations, logarithmic paper 
is available, both vertical and horizontal rulings being in the logarithmic 
scale. For directions to the use of graph paper with special scales, see 
reference (8). 

EXAMPLE 14.1 — J. W. Gowen and W. C. Price counted the number of lesions 
of Aucuba mosaic virus developing after exposure to X-rays for various times. The 
following record is from as yet unpublished data made available through courtesy of the 
investigators. 


Minutes exposure 

0 

3 

7 5 

15 

30 , 

45 

60 

Count in hundreds 

271 

226 

209 

108 

59 

29 

12 


Plot the count as ordinate, then plot its logarithm. Derive the regression, Y — 2.432 
— 0.02227A', where Y is the logarithm of the count and X is minutes exposure. 


* To get this form, note that 0.1959 =* log 1.57 and 2.689 » log 488.65, hence 
log W « (log 1.57)’(Z) - log 488.65 » log 1.57* - log 488.65 

- log ^5 - log (0.002046X1.57*). 


t This comes from the first form because log,1.57 ® 0.4511, whence 1.57"e<° ,4$u . 
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EXAMPLE 14.2 — Repeat the fitting of the last example using natural logarithms. 
Verify the fact that the rate of decrease of hundreds of lesions per minute per hundred 
is (2.3026) (0.02227) » 0.05128. 

EXAMPLE 14.3 — If the meaning of relative rate isn’t quite clear, try this approxi- 
mate method of computing it. The increase in weight of the chick embryo during the 
thirteenth day is 1.130 — 0.738 * 0.392 gram; that is, the average rate during this 
period is 0.392 gram per day. But the average weight during the same period is 
(1.130 -f 0.738)/2 «■ 0.934 gram. The relative rate, or rate of increase of each gram, 
is therefore 0.392/0.934 ** 0.42 gram per day per gram. This differs from the average 
obtained in the whole period from 6 to 16 days, 0.4511, partly because the average 
weight as well as the increase in weight in the thirteenth day suffered some sampling 
variation, and partly because the correct relative rate is based on weight and increase 
in weight at any instant of time, not on day averages. 


EXAMPLE 14.4 — Geddes (6) gives single figure estimates of the baking auality 
of straight grade flour (0) after being heated at 170° F. for various numbers of hours 
(T). Pairs of values (T,Q) are as follows: 0 25, 93; 0.50, 71; 0.75, 63; 1.0, 54; 1.5, 43; 
2.0, 38; 3.0, 29; 4.0, 26; 6.0, 22; 8.0, 20. Plot on the vertical axis against T on the 
horizontal. Setting log T = X and log Q ■= F, plot Y against X. Derive the regression, 
Y - 1.7116 - 0.4678AT. 


EXAMPLE 14.5 — Placing Y 1/Q. in the last example, plot Y against T. Since 

1 


the graph is not linear, try Y 


a - 14 


. Compute the regression of the rectified data, 


Y - 0.01 996 T + 0.00546, from which Q - 0.01996 7' V 0.0054 6 + U ‘ Runnin « (H 

VI) gives a graphical method for estimating the number that must be subtracted from 
W in order to get a straight regression. Can you improve on 1 4? How many degrees 
of freedom remain for testing with the mean square of errors of estimate? 


EXAMPLE 1 4.6 — Decker and Andre (2) investigated the mortality of chinch bugs 
exposed at a temperature of —12.2° C. for various periods: 


Hours exposure 

0.25 

0.5 

1 

4 12 24 48 72 

Percentage mortality 

59 

63 

65 

68 70 73 74 75 


Let Y ** percentage mortality and X =® logarithm of hours exposure. Fit the equation, 
Y « 6.051 X + 64.1. To plot these data on logarithmic paper, put the logarithmic 
scale horizontal and plot hours on it. 


EXAMPLE 14.7 — In an experiment on wheat in Australia, fertilizers were applied 
at various levels with these resulting yields: 


Level 

X 

0 

10 

20 

30 

40 

60 

Yield 

y 

26.2 

34.0 

36.3 

37.8 

38 6 

38.9 


Fit to these data the Mitscherlich equation (16), 

Y ** a — bX, 

where Y « log (39 — /). The yield, 39, is taken as the maximum yield irrespective 
of the amount of fertilizer. The fitted equation is Y = 0.01 434X — 0.2294. 
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14.3 — The second degree polynomial. Faced by nonlinear regression, 
one often has no knowledge of a theoretical equation to use. In many 
instances the second degree polynomial, 

Y - a + bX + cX 2 , 

will be found to fit the data satisfactorily. The graph is a parabola whose 
axis is vertical, but usually only small segments of such a parabola appear 
in the process of fitting. Instead of rectifying the data a third variate is 
added, the square of X. This introduces the methods of multiple regres- 
sion. The calculations proceed exactly as in chapter 13, X and X 1 being 
the two independent variates. It need only be remarked that y/X y log X , 
or \/X might have been added instead of X 2 if the data had required it. 

To illustrate the method and some of its applications, we present the 
data on wheat yield and protein content (10) in table 14.2 and figure 14.2. 
The investigator wished to estimate the protein content for various yields. 
We shall also test the significance of the departure from linearity. 



YIELD OF WHEAT IN BUSHELS PEB AQBE 

Fig. 14.2 — Regression of protein content on yield in wheat, 91 plots. 

Y - 17.703 - 0.3415A' + 0.004075A? 

The second column contains the squares of the yields in column 1. 
The squares are treated in all respects like a third variate in multiple 
regression. The computations described in chapter 13 yield the regres- 
sion statistics of table 14.3. The regression equation, calculated as usual, 

Y * 17.703 - 0M15X+ 0.004075X 2 , 

is plotted in the figure. At small values of yield the second degree term 
with its small coefficient is scarcely noticeable, the graph falling away 
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TABLE 14.2 

Percentage Protein Content (F) and Yield {X) of Wheat 
From 91 Plots * 


Yield, 

Bushel 

Per Acre 

X 

Square 

X} 

Percentage 

Protein 

F 

Yield, 

Bushel 

Per Acre 

X 

Square 

X* 

Percentage 

Protein 

Y 

43 

1,849 

10.7 

19 

361 

13 9 

42 

1,764 

10 8 

19 

361 

13.2 

39 

1,521 

10.8 

19 

361 

13 8 

39 

1,521 

10.2 

18 

324 

10 6 

38 

1,444 

10 3 

18 

324 

13 0 

38 

1,444 

9.8 

18 

324 

13 4 

37 

1,369 

10.1 

18 

324 

13 7 

37 

1,369 

10.4 

18 

324 

13.0 

36 

1,296 

10 3 

17 

289 

13 4 

36 

1,296 

11.0 

17 

289 

13 5 

36 

1,296 

12.2 

17 

289 

10 8 

35 

1,225 

10.9 

17 

289 

12 5 

35 

1,225 

12.1 

17 

289 

12 7 

34 

1,156 

10 4 

17 

289 

13 0 

34 

1,156 

10.8 

17 

289 

13 8 

34 

1,156 

10.9 

16 

256 

14 3 

34 

1,156 

12.6 

16 

256 

13 6 

33 

1,089 

10.2 

16 

256 

12 3 

32 

1,024 

11.8 

16 

256 

13 0 

32 

1,024 

10 3 

16 

256 

13 7 

32 

1,024 

10.4 

15 

225 

13 3 

31 

961 

12.3 

15 

225 

12.9 

31 

961 

9.6 

14 

196 

14 2 

31 

961 

11.9 

14 

196 

13 2 

31 

961 

11 4 

12 

144 

15.5 

30 

900 

9.8 

12 

144 

13 1 

30 

900 

10.7 

12 

144 

16 3 

29 

841 

10 3 

11 

121 

13 7 

28 

784 

9 8 

11 

121 

18 3 

27 

729 

13 1 

11 

121 

14.7 

26 

676 

11.0 

11 

121 

13 8 

26 

676 

11 0 

11 

121 

14 8 

25 

625 

12.8 

10 

100 

15 6 

25 

625 

11 8 

10 

100 

14 6 

24 

576 

9.9 

9 

81 

14 0 

24 

576 

11.6 

9 

81 

16 2 

24 

576 

11 8 

9 

81 

15 8 

24 

576 

12 3 

8 

64 

15 5 

22 

484 

11 3 

8 

64 

14.2 

22 

484 

10 4 

8 

64 

13 5 

22 

484 

12 6 

7 

49 

13 8 

21 

441 

13.0 

7 

49 

14 2 

21 

441 

14.7 

6 

36 

16 2 

21 

441 

11 5 

5 

25 

16 2 

21 

441 

11.0 




20 

400 

12.8 




20 

400 

13.0 





* Read from published graph. This accounts for the slight discrepancy between the 
correlation we got and that reported by the author. 
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TABLE 14.3 

Regression Statistics for Protein Content of Wheat 


Variate 

Zero Order Correlations 

Sy - 315.75 
- 0.69111 

R » 0.8313 

X* Y 



X 

0.9804 -0.8063 


X} 

-0.7506 



Remainder after linear regression — (1 — r^iSy 3 ) *» 110.48 
Remainder after curved regression *• (1 — R 2 )(Sy) » 97.53 


almost like a straight line. Toward the right, however, the term in X} has 
bent the curve to practically a horizontal direction. 

It is interesting to observe the effect of curvilinearity of regression on 
the correlations. If linear regression is assumed, the correlation between 
yield and protein content is —0.8063; but when the second degree term 
is introduced, the correlation between Y and Y becomes R = 0.8313. Is 
this small increase in absolute value significant of a curvilinear population 
regression? We set up the hypothesis of linear population regression, 
calculate the sum of squares of errors of estimate at the bottom of table 
14.3, then enter the result along with its degrees of freedom (91 — 2 = 89) 
in table 14.4. In like manner, the sum of squares and degrees of freedom 


TABLE 14.4 

Test of Significance of Departure From Linear Regression 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Deviations from linear regression 
Deviations from curved regression 

89 

88 

110 48 

97 53 

1.11 

Curvilinearity of regression 

1 

12.95 

12 95** 

F - 

12.95/1.11 = 11.7 




for curvilinear regression are calculated and recorded. The reduction in 
sum of squares, tested against the mean square remaining after curvilinear 
regression, proves to be significant. The hypothesis of linear regression 
is abandoned, therefore: there is said to be a significant curvilinearity in 
the regression. 

For any given yield, say 20 bushels per acre, the expected protein con- 
tent is 


Y = 17.703 - (0.341 5) (20) + (0.004075) (20) 2 = 12.51% 
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In this way, the regression equation is useful for estimating and interpolat- 
ing. Fiducial statements and tests of hypotheses are made as in chapter 13. 

It may be thought that there is a great chasm between the rectifica- 
tion of data in the last section and the treatment of curvilinear regression 
in this. The distinction, however, is really superficial. If a three-dimen- 
sional graph were made, X and X 2 being plotted on separate axes, the 
regression surface would be a plane. In that sense the data would be 
rectified as before. It is only the limitations of the two-dimensional graph 
that make the methods seem different. 

As always in regression, either linear or curved, one should be wary of 
extrapolation. The data may be incompetent to furnish evidence of 
trend beyond their own range. Looking at figure 14.1, one might be 
tempted by the excellent fit to assume the same growth rate before the 
sixth day and after the sixteenth. The fact is, however, that there were 
rather sharp breaks in the rate of growth at both these days. To be useful, 
extrapolation requires extensive knowledge and keen thinking. 

EXAMPLE 14.8 — Fit a second degree polynomial to the data of example 14.1. 
The result is Y = 262.245 — 9.851 AT -p 0.09681 A* 2 , not a very good fit. 

EXAMPLE 14.9 — Decker and Andre (2) exposed eight lots of 500 chinch bugs 
at —6.6° C. for varying intervals of time. The resulting mortality was as follows: 


Days exposure 

1 

3 3.5 8 13 5 16 

Percentage mortality 

0.8 

36 5 8 11 6 22 8 44 


Fit a second degree ploynomial to learn if it reveals a significant deviation from linear 
regression. F = 6.09, dj. — 1 and 5, shows that the departure from linear regression, 
as tested by the fitted parabola, is just short of significance. 

EXAMPLE 14.10 — You may think that the parabola is not a suitable equation 
to fit to the data of the last example. Try putting Y equal to the logarithms of the 
mortality percentages, then fitting Y = a 4 * bX y where X is days of exposure. 

How will you compare the fit of this curve with that of the parabola? For the latter, 
the mean square deviation from second degree regression was 32.2 with d.f. = 5. 
To get a comparable figure from the logarithmic fitting, transform each Y to its anti- 
logarithm P and calculate the deviations from regression. The mean of their squares 
is 1,270, far greater than the corresponding figure for deviation from second degree 
regression. 


14.4 — Test of deviation from linear regression. In the foregoing 
section this test was made in table 14.4 after the curvilinear regression 
had been computed. If the data occur in groups, and if it is only deviation 
from linearity of regression that is interesting, the test can be made rather 
simply. The method is illustrated by use of the data in table 14.5, made 
available through the courtesy of B. J. Vos and W. T. Dawson. Four 
rates of injection were used, each double the preceding. The particular 
question under investigation was whether the regression of Y on X is 
curved. A graph will be quite sufficient to convince most people that 
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TABLE 14.5 

Lethal Dose (Coded by Subtracting 50 Units) op United States Standard 
Ouabain, Slow Intravenous Injection in Cat to Endpoint op Cardiac Stoppage 


Rate of injection, X — — — 
lUuu 

Total 


1.04575 

2.0915 

4.183 

8.366 


5 

3 

34 

51 



9 

6 

34 

56 



11 

22 

38 

62 



13 

27 

40 

63 



14 

27 

46 

70 



16 

28 

58 

73 



17 

28 

60 

76 



20 

37 

60 

89 



22 

40 

65 

92 



28 

42 





31 

50 





31 





SY 

217 

310 

435 

632 

1,594 

Number cats 

12 

11 

9 

9 

41 

V 

18 1 

28.2 

48.3 

70.2 

38.9 

SY 5 

4,727 

10,788 

22,261 

45,940 

83,716 


there is no evidence of curvilinearity. However, we shall make the test 
of significance as an illustration of the method. 

First, the total sum of squares of the lethal doses is analyzed in the 
familiar manner of table 14.6. It is clear that the chief source of variation 
is that induced by the rates of injection — the group means differ signifi- 
cantly. We are interested in learning whether the trend of the means 
departs significantly from linearity. 

For regression, Sx 2 and Sxy are required, corresponding to Sy* = 21,744 


TABLE 14.6 

Analysis of Variance of Lethal Doses of Ouabain at Four Rates of Injection 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Within rate groups (error) 

37 

5,651 

152.7 

Rates of injection 

3 

16,093 

5,364** 

Total 

40 

21,744 





384 


Statistical Methods 


Section 14.5 


in the table. The calculations differ somewhat from usual because of the 
occurrence of F in groups. Thus, 

SX = 12(1.04575) + 11(2.0915) + 9(4.183) + 9(8.366) = 148.496, 
a weighted sum. Similarly, 

&F 2 = 12(1.04575) 2 + . . . + 9(8.366) 2 = 848.628 
From these is computed, 

Sx 2 = vSX 2 - (sxy/n = 848.628 - 537.831 = 310.797 
For the sum of products of the observations, 

SAY = (1 .04575) (217) + (2.091 5) (310) + (4.183)(435) + (8.366)(632) 
= 7,982.21 

Deducting the correction term, (148.496) (1 ,594)/41 = 5,773.23, we have, 
Sxy = 7,982.21 - 5,773.23 = 2,208.98 

Finally, the sum of squares for group means is copied from table 14.6 
into table 14.7, then divided into the two parts, one attributed to linear 
regression and the other to deviations therefrom. The former is, as usual, 
(Sxyy/(Sx 2 ) = (2,208.98) 2 /310.797 = 15,700, having one degree of free- 
dom. The remainder, 393, with two degrees of freedom furnishes the 
mean square corresponding to deviations from linear regression. This 
mean square is little greater than that for error. No more than random 
sampling departure from linear regression is indicated. 


TABLE 14.7 

Analysis of Sum of Squares for Group Means 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Rates of injection (table 14.6) 

3 

16,093 


Linear regression, (Sxy) 2 / (Sx 7 ) 

1 

15,700 


Deviations from linear regression 

2 

393 

196 

Error (table 14.6) 

37 


153 

F - 196/153 « 1.3 

14,5 — Test of departure 

from linear 

regression in covariance 


analysis. As in any other correlation and regression work, it is necessary 
in covariance to be assured that regression is linear, else modifications 
must be made in the methods. In some cases, randomized blocks for 
example, it is only the error regression that matters. If the error regression 
is rectified, the adjusted means are appropriate estimates and the test of 
significance is valid. 



Section 14.5 


Curvilinear Regression 


385 


EXAMPLE 14.11 — There is available some additional information about the 
chinch bugs in example 14.9. Each of the eight mortality percentages was an average 
of five subsamples of 100 bugs. The mortality percentages in the subsamplC are as 
follows: 


Sub- 
sample 
of 100 

Days Exposure 

1 

3 

3.5 

8 

13.5 • 

16 

21 

28 

1 

0 

3 

4 

9 

21 

46 

68 

93 

2 

0 

4 

5 

21 

23 

39 

70 

100 

3 

1 

2 

6 

8 

27 

43 

66 

92 

4 

1 

5 

7 

10 

25 

47 

61 

86 

5 

2 

4 

7 

10 

18 

48 

73 

89 

Total 

4 

18 

29 

58 

114 

223 

338 

460 


Analyze the variance: 

Within dates 421.2, 32 </./., mean square 13.2 

Between dates 39,934.4, 7 dj., mean square 5,704.9 

Although some heterogeneity is indicated, the mean square, 13.2, may be taken as the 
estimate of error against which to test any of the mean squares in example 14.9. For 
example, F « 32.2/13.2 = 2.44, dj. » 5 and 32, shows that the departure of these 
data from second degree regression is not significant. 

EXAMPLE 14.12 — Swanson and Smith (15) determined the total nitrogen content 
(F grams per 100 cc. of plasma) of rat blood plasma at nine ages ( X days). 


Number 
of Rats 

Age 

25 

37 

50 

60 

80 

100 

130 

180 

360 

1 

0.83 

0.98 

1 07 

1.09 

0 97 

1.14 

1.22 

1.20 

1.16 

2 

0 77 

0.84 

1.01 

1.03 

1.08 

1.04 

1.07 

1.19 

1.29 

3 

0 88 

0 99 

1.06 

1.06 

1.16 

1 00 

1.09 

1.33 

1.25 

4 

0.94 

0.87 

0.96 

1.08 

1.11 

1.08 

1.15 

1.21 

1.43 

5 

0.89 

0.90 

0.88 

0.94 

1.03 

0.89 

1.14 

1.20 

1.20 

6 

0.83 

0.82 

1.01 

1.01 

1.17 

1.03 

1.19 

1.07 

1.06 

7 

0.84 

0.95 

0.93 


0.98 

1.08 

1.19 

1.13 

1 29 

8 

0.75 

0.92 

1.07 


0.99 

0.98 

1.14 

1.12 

1.25 

9 

0.67 

0.87 

1.03 



0.98 

1.13 


1.23 

10 

0.70 


1.13 



1.10 

1.06 


1.22 

11 

0.77 


1 05 




1.11 


1.17 

12 

0.76 


0.96 




1.64 



13 

0.75 


1 01 




1.29 



14 

0 78 


1 01 




1.09 



15 ! 

0.76 


0.94 




1.14 



16 

0.86 


1.03 




1.12 



17 

0.78 


1 06 




1 29 



18 

0 84 


1.08 




1.28 



19 

0.85 


1.09 




1.10 



20 

0.83 


0.93 




1.07 



21 

0.85 


0.95 




1.10 



22 

0.85 









23 

0.83 









24 

0.83 









25 

0 70 
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Verify the following: 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Total 

118 

2.8686 


Ages 

8 

2.3076 


Individuals in age groups (error) 

110 

0.5610 

0.0051 

Ages 

8 

2 3076 


Linear regression 

1 

1.4851 


Deviations 

7 

0.8225 

0.1175** 


For illustration we shall return to the sugar beet data of table 12.13. 
An agronomist, studying the findings of the covariance analysis, expressed 
surprise that the differences among the treatment mean weights per root 
were not significant. One phase of the question was developed in example 
12.10. Another leads to a test of the linearity of the regression. The method 
is the same as that of section 14.3. The squares of the numbers of beets in 
the plots, X 2 , considered as a second independent variate, are used with 
the first powers, X , and the dependent yield, F, in the multiple covariance 
analysis of table 14.8. The squares are expressed in units of 1,000, 
(183) 2 = 33,489 being entered in the data sheet as X 2 = 33. Corrections 
for origin are made in the usual fashion (section 13.10). In section 12.7 
some of the preliminary calculations are described in detail. 

TABLE 14.8 

Analysis of Variance and Covariance of Numbers of Beets (X), the Square (X*) 

and Yield ( Y ) 


Source of Variation 

Variable 

X 

i 

X 2 

Y 

Total 

Blocks 

Treatments 

Error 

X 

152,158.00 
7,472 57 
116,020 33 
28,665.10 

88,165 00 
4,542 00 
66,953 67 
16,669 33 

4,163 69 
- 116.56 

3,598 05 
682 20 

Treatment -f error 


144,685.43 

83,623 00 

4,280.25 

Total 

Blocks 

Treatments 

Error 

X 2 


51,704 48 
2,782 19 
38,779.48 
10,142.81 

2,383 94 
- 71.88 

2,065 57 
390.25 

Treatment -f error 



48,922 29 

2,455 82 

Total 

Blocks 

Treatments 

Error 

F 



142.4022 

6 3134 
112 8562 
23.2326 

Treatment 4* error 




136.0888 
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The sums of squares and products for error lead to the multiple corre- 
lation, ip « 0.7029. The product of Sj? — 23.2326 (in the line for 
error) by (1 — ip) = 0.2971 is the sum of squares of errors of estimate 
from quadratic regression, entered in table 14.9 along with a number 


TABLE 14.9 

Test of Significance of Deviations From Error Linear Regression 

Sugar Beets 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Deviations from linear regression, 5>< 2 (1 ~r 2 ) 
Deviations from quadratic regression, 5)^(1 — F 2 ) 

29 

28 

6.9969 

6.9024 

0.2465 

Reduction due to quadratic regression 

1 

0,0945 



from table 12.14 giving similar information about deviations from linear 
regression. The sum of squares (d.f. = 1) credited to curvilinearity of 
regression, 0.0945, is less than the error mean square. Certainly, departure 
from linearity of regression is negligible. 

In some other example similar to this the regression might turn out to 
be significantly nonlinear. You would then wish to test the significance 
of the differences among means adjusted by the curvilinear regression. 
Turning back to table 14.8, you would make a second multiple regression 
of the treatment + error sums of squares and products, resulting in R 2 
= 0.9343. The test of significance is then carried through (table 14.10) 
in the familiar way. In the present example, since the degree of freedom 
for second degree regression does not carry an average amount of sum of 
squares, the F-value is smaller than that of table 12.14. One would use 
this test only if there were a significant reduction in sum of squares 
attributable to curvilinearity of regression. 

TABLE 14.10 

Test of Significance of Differences Among Mean Yields Adjusted 
for Curvilinear Regression 
Sugar Beets 



Sum of 
Squares 
& 


Errors of Estimate 

Source of Variation 

1 - R* 

Sum of Degrees of Mean 
Squares Freedom Square 

Treatment 4* error 
Error 

136.0888 

23.2326 

0.0657 

0.2971 

8.9410 34 

6.9024 28 0.2465 

Treatment 



2.0386 6 0.3398 


F - 0.3398/0.2465 « 1.38 
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EXAMPLE 14.13 — The test for curvilinearity in multiple regression is similar 
to those in the last two sections. Emmert (4) measured the nitrogen, TV, and phos- 
phorous, P, (parts per million) in the steins of potato plants on June 1 5, 1 935, then 
recorded the associated yield in pounds, Y, from each of 37 plots. The multiple regres- 
sion of Y on TV and P had R ** 0.61 86. Since there was some evidence of curvilinearity, 
the variables TV * and F 1 were added. The multiple regression of Y on TV, P, TV* and P* 
yielded R « 0.6522. The increase in R indicates some departure from linearity. The 
D-primes showed that this was due to the term in P % only, the TV* term yielding no 
additional information. The latter term being dropped, therefore, the regression of 
Y on TV, P and P* gave R » 0.6522 as before. Since only a test of significance is desired, 
it isn’t necessary to multiply each (1 — R *) by £y* The test is as follows: 


Source of Variation 

Degrees of 
Freedom 

1 - P* 

Factor for Test 
of Significance 

Regression of Y on TV and P 

34 

0 6174 


Regression of Y on N, P and P* 

33 

0.5747 

0.0174 

Curvilinearity 

1 

0 0427 

0.0427 


F - 0.0427/0.0174 « 2.45, dj . - 1 and 33 


F being far short of its 5% value, the departure from linearity is attributable to acci- 
dents of sampling. 

The data are below. 


TV 

P 

P* 

Y 

TV 

P 

P* 

Y 

139 

195 

38 

33 

330 

260 

68 

46 

143 

146 

21 

43 

380 

134 

18 

60 

147 

120 

14 

40 

436 

130 

17 

57 

154 

178 

32 

49 

436 

200 

40 

64 

162 

266 

71 

43 

470 

134 

18 

55 

182 

200 

40 

27 

476 

160 

26 

54 

186 

260 

68 

55 

555 

155 

24 

37 

200 

240 

58 

51 

556 

188 

35 

42 

208 

170 

29 

45 

626 

125 

16 

45 

213 

195 

38 

41 

700 

146 

21 

63 

233 

146 

21 

57 

750 

178 

32 

52 

257 

160 

26 

38 

834 

188 

35 

46 

264 

146 

21 

48 

834 

178 

32 

56 

264 

146 

21 

39 

908 

105 

11 

65 

264 

160 

26 

40 

1125 

160 

26 

46 

264 

250 

62 

38 

1200 

122 

15 

58 

265 

175 

31 

52 

1250 

140 

20 

80 

286 

114 

13 

49 

1625 

150 

22 

70 

312 

155 

24 

51 






Y - 0.0170TV - 0.440P + 1.1 OP* + 82.5 


14.6 — The fitting of orthogonal polynomials. A special method 
has been perfected by Fisher (5) for fitting the polynomial, 

Y - a + bX+'cX* + dX*+ ... 

This is a flexible curve, adaptable to fitting a wide assortment of data. 
It includes the straight line and the parabola as special cases. Fisher’s 
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method is presented both for its ready utility and for its theoretical value. 
We shall illustrate only the case in which the values of X are spaced at 
unit intervals, each accompanied by a single value of Y. If X occurs in 
equally-spaced intervals different from unity, it may be coded to unit 
intervals by dividing each value by the common interval. In data having 
X unequally spaced or having more than one Y for some or all of the ^?s, 
use the method of section 14.4. 

One advantage of the present method is that the fitting can be carried 
through conveniently in successive stages, the success of fitting terms of 
higher and higher degree being observed and tested for significance at 
each stage. The fitting will be terminated with the fourth degree term in 
the example below. Those who wish formulas for higher degree terms may 
infer them readily with the assistance of the footnotes. Professor Fisher 
gives general formulas in his book. 

For illustration, the chick embryo data of table 14.1 will be used. The 
dry weights are copied in the first column of table 14.11. The sum, 

TABLE 14.11 


Sums and Derived Quantities for Fitting a Polynomial to Chick Embryo Data 



Y 

2 


3 


4 

5 


0 029 

0 029 

0 

029 


0 029 

0 029 


0 052 

0 081 

0 

110 


0 139 

0.168 


0 079 

0.160 

0 

270 


0.409 

0 577 


0 125 

0 285 

0 555 


0 964 

1.541 


0.181 

0 466 

1 

021 


1 985 

3 526 


0.261 

0 727 

1 

.748 


3 733 

7.259 


0 425 

1 152 

2, 

.900 


6.633 

13 892 


0.738 

1.890 

4 

790 


11.423 

25 315 


1 130 

3 020 

7 

810 


19 233 

44.548 


1 882 

4 902 

12. 

.712 


31.945 

76 493 


2.812 

7 714 

20 

426 


52.371 

128.864 

S\ 

-7.714 

S% - 20 426 Si - 52 

371 S< 

- 

128.864 Si 

- 302 212 

a 

- 0.701273 

4 = 0 309485 c 

« 0 

183115 

d 

- 0 128735 

e =*0.100637 

a ' 

- 0 701273 

b' - 0 391788 c' 

- 0 

139048 

d f 

- 0 031838 

-0 003066 

A 

* 0 701273 

B = 0.235073 C 

- 0, 

.0463493 D 

- 0.00619072 E -0.00038325 


Si = 7.714, is divided by n = 11, the number of items in the set, to get 
the mean, a = 0.701273, as specified in the first of the following formulas: 



0K2) 

n{n + 1) 


{Si) 


(1)(2)(3) 
n(n + 1 )(n + 2) 


{Si) 


C — 
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e = 


(1)(2)(3)(4) 


(S t ) 


n(n + 1) (« + 2)(n + 3) 

(1 ) (2) (3) (4) (5) 

n(n + l)(n + 2 )(n + 3)(n + 4) 


(St,) 


The remaining formulas will be used later. 

The mean, a, is copied in the next line as a', the two being equal 
according to the first of the next set of formulas: 
a! =■ a 
b‘ = a - b 
c' = a — 36 + 2c 
d! « a - 6b + 10c - 5 d 
e' = a - 103 + 30c - 35 d + 14c* 

Again, A — a' — a in the first of the third set: 

A = a' 


B = 


6 


(*') 


C = 


£> = 


£ = 


30 


(n - l)(n - 2) 


(O 


140 


(d') 


(n - l)(n - 2 )(n - 3) 

630f 

(n - 1)(« - 2 )(n - 3)(n - 4) 


(O 


Finally, the sum of squares of deviations from mean is got by sub- 
tracting from SY 2 = 1 3.577730 the correction term nA 2 — (11) (0.701273)* 
= 5.409622, set down as the first of these quantities: 


nA* = 


(SY)* 


n(n* - 1) 

12 


(&) 


* The next three of these equations are 

« a - 15* + 70c - 140 d + 126* - 42/ 

/ - a - 21* + 140c - 420 d -f 630c - 462/ + 132* 

- a - 28* + 252c - 10504 -f 2310c - 2772/ + 1716* - 429* 
t The numerators of the next three terms are 2,772, 12,012, 51,480. 
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n(n 2 — 1 )(n 2 — 4) 
180 


(CP) 


n(n 2 - t)(n 2 - 4)(n 2 - 9) 

2,800 ' ' 

»(* ~ l)(« a ~ 4)(» 2 - 9)(* 2 - 16) 

44,100* ( ’ 

The remainder, Sy 1 = *ST 2 — nA 2 = 8.168108, is the end of the first stage 
in the fitting. A horizontal straight line, Y = 0.701273, has been fitted 
to the data. The sum of squares of deviations from this line is Sy 1 . 

The second stage starts with the sums in column 2 of the table. After 
the first item is copied at the top of the column, the successive sums are: 

0.029 + 0.052 = 0.081 
0.081 + 0.079 - 0.160 
0.160 + 0.125 = 0.285, etc. 


These are readily run up on an adding machine. The last sum, 7.714, 
is the same as Si, a convenient check. The sum of column 2 is S'* = 20.426. 
Going through the sets of formulas again, there result: 

b = (20.426) = 0.309485 

b'= 0.701273 - 0.309485 = 0.391788 

B = ^ (0.391788) = 0.235073 

S(Y - f) 2 = 8.168108 - — (0.235073) 2 

= 8.168108 - 6.078525 = 2.089583 


The reduction in sum of squares due to fitting the linear regression is 
6.078525 with 1 degree of freedom. This may be tested for significance 
by comparing it with the remainder, 2.089583, having 9 d.f. The mean 
square for error is, therefore, 2.089583/9 = 0.232, yielding F = 6.079 
/0.232 » 26.2, a highly significant value for d.f . = 1 and 9. 


The denominators of the next three terms are 698,544, 11,099,088, 176,679,360. 
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The third stage involves fitting the second degree term. Starting with 
S 9 - 52.371, 


(1)(2) (3) 


(52.371) = 0.183115 


(11)(12)(13) 

c'= (0.701273) - 3(0.309485) + 2(0.183115) = 0.139048 
30 


C = 


(10) (9) 


(0.139048) = 0.0463493 


5(7 - P) 2 = 2.089583 - (0.0463493) 2 

= 2.089583 - 1.843205 = 0.246378 

The mean square for testing is now 0.246378/8 = 0.0308, so that 
F = 1.843/0.0309 = 59.8, which with d.f. = 1 and 8 is highly significant. 
In the next stage, the third degree term is fitted with this result: 

5(7 - t y = 0.246378 - 0.236757 = 0.009621 

This yields F = 0.237/0.00137, d.f. = 1 and 7, another highly significant 
reduction. 

One would be justified in stopping at this point. Since 1 — /? 2 is the 
ratio of the remaining sum of squares to Sy 2 , we have 

1 - R 2 = 0.009621/8.1681 = 0.001178 
, R = V0.998822 = 0.9994, 

about as high a correlation between estimated and actual values as one 
could wish. However, for two reasons the fitting will be carried one stage 
further: to illustrate the method, and because the third degree polynomial 
exhibits certain snakelike curves that make it unsuited to the chick embryo 
data. 

Taking the last of the successive sets of formulas, with S& = 302.212, 
one arrives at 

S(Y - ?) 2 = 0.009621 - 0.006049 = 0.003572, 

so that F = 0.006049/0.000595 = 10.2, a significant, but not highly 
significant, reduction. 

For many purposes, the job is complete. Some, though, may wish to 
plot the curves, and some may require the equations. Those who need 
the graphs but not the equations should follow the procedure of the next 
few paragraphs. If equations are necessary, skip to the sixth paragraph 
below. 

In table 14.12 are entered some polynomials with corresponding co- 
efficients which enable one to build up the desired values of Y by adding 
sets of differences to a terminal value. The coefficients depend on the size of 
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TABLE 14.12 

Coefficients and Polynomials for Terminal Values and Differences for Fitting 
Terms up to the Seventh Degree 


Degree 

of 

Poly- 

nomial 

Coefficient Depending 
on Sample Size 

Polynomial Ending With Term of Degree 

0 1 2 3-4 5 6 7 

0 

(Terminal Value) 

a' + 36' + 5c' 4- 7 d' + V + 11/' + 13/ + 15V 

1 

6 

n - 1 

V + 5 c' +14 d' +30c' + 55/' + 91/ +140 A' 

2 

60 

(a - l)(a - 2) 

c' 4- Id* +27*' + 77/' +182/ +37 Sh* 

3 

' 

— 840 

(a — 1) (a— 2) (a — 3) 

d' + 9c' + 44/' +156/ +450 A' 

4 

15,120 

(n - 1) ... (n - 4) 

c' + 11/' + 65/ +275V 

5 

-332,640 

(a - 1) ... (a - 5) 

f' + 13/ + 90 V 

6 

8,648,640 

(a — 1) ... (a — 6) 

/ 4* 15 h* 

7 

-259,459,200 
(a - 1) ... (a - 7) 

b! 


the sample, while the polynomials are determined by the degree of the 
last term fitted. As a first illustration of the method we shall calculate 
points on the best fitting straight line. 

Since the last term now to be considered is of degree bne, the terminal 
value is (a' + 3 b’) = 0.701273 + 3(0.391788) = 1.8766. The first differ- 
ence is 

- <+ (t,) - - orby (03,1788 > - -° 2351 

By successive additions of this first difference to the terminal value, 1.8766, 
one gets the successive values of F, beginning with the largest correspond- 
ing to X = 16: 

1.8766 + ( — 0.2351) = 1.6415 
1.6415 + (-0.2351) » 1.4064 
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X Polynomial Value, Y First Difference 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 Terminal value 


-0.474 

-0.239 

-0.004 

0.231 

0.466 

0.701 

0.936 

1.171 

1.406 

1.642 

■ 1 .8766 -0.2351 


In the calculating machine all the four decimals are carried, but the last 
one is deleted in the column of polynomial values. These values are 
plotted in figure 14.3. The straight line is a poor fit, of course, but it does 
pass through the point (x, y)\ that is, (11, 0.701). fc. ‘ 

Next, let us calculate the ordinates for the best fitting parabola. From 
the table, ending with degree 2: 



Fio. 14.3 — Graphs of polynomials of first, second, and fourth degree fitted to chick 
embryo data of table 14.1. 
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Terminal value = a f + 3 V + 5 c f 

= 0.701273 + 3(0.391788) + 5(0.139048) = 2.57188 

First difference ( b ' + 5c') 

n — 1 

= - [0.391788 + 5(0.139048)] = -0.65222 


Second difference 


60 

(n - l)(n - 2) 


(O 


60 

(10) (9) 


(0.139048) = 0.092699 


The calculations are carried through like this: 


X 

Y 

First Difference 

Second Difference 

6 

0.222 

0.2748 


7 

0.039 

0.1821 


8 

-0 050 

0.0894 


9 

— 0.047 

-0.0033 


10 

0 049 

-0.0960 


11 

0.238 

-0.1887 


12 

0.519 

-0 2814 


13 

0.893 

-0.3741 


14 

1 360 

-0 4668 


15 

1.920 

-0 5595 


16 

Terminal value « 2 . 57 1 88 

-0 65222 

0.092699 


The second difference is added successively to the bottom first difference, 
0.092699 + (-0.65222) = -0.5595, 

and to each of the sums above it. Each first difference, starting with the 
last, is added to a value of Y to get the one next above. Thus, 

- 0.65222 + 2.57188 = 1.920 

- 0.5595 + 1.920 = 1.360, etc. 

Y is plotted against X to make the graph of the parabola. 

Passing to the fourth degree polynomial, the terminal value and differ- 
ences are: 

Terminal value = a' -f 3 b' + 5c' + Id' + 9c' - 2.822337 
6 

n - 1 


First difference = — 


( b ' + 5c' + 14/ + 30c') = -0.974844 
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Second difference — 


60 


Third difference 


Fourth difference = 


(n - l)(n - 2) 

-840 

(« - l)(n - 2 ;)(« - 3) 

15,120 


(c’ + Id' + Tie') = 0.296464 
(</' + 9e') = -0.069337 
(/) = 0.009198 


(n — l)(n — 2 )(n — 3)(n — 4) 

These are entered in the last line of table 14.13. The third differences 


TABLE 14.13 

Calculation of the Fourth Degree Polynomial Values for Chick Embryo Data 


X 

Y 

First 

Difference 

Second 

Difference 

Third 

Difference 

Fourth 

Difference 

6 

0.026 





7 

0 056 

— 0.0302 




8 

0 086 

-0 0295 

-0 00069 



9 

0.119 

-0.0337 

0.00426 

-0.004951 


10 

0.171 

-0.0521 

0 01841 

-0 014149 


11 

0.265 

-0 0939 

0 04176 

-0 023347 


12 

0.434 

-0 1682 

0.07430 

-0 032545 


13 

0.718 

-0.2843 

0 11605 

-0 041743 


14 

1.169 

-0.4512 

0.16699 

-0 050941 


15 

1 847 

-0.6784 

0.22713 

-0.060139 


16 

2.8223 

-0 97484 

0 296464 

-0 069337 

0.009198 


result from successive additions of 0.009198. The second differences are 
calculated thus: 

-0.069337 + 0.296464 = 0.22713 
—0.060139 + 0.22713 = 0.16699, etc. 

All the significant figures are carried in the machine, as before, but one 
is dropped in copying. The first differences are got in a similar manner, 
then the polynomial values. These last, plotted on the graph, specify the 
fourth degree curve. It passes so nearly through the data points that the 
discrepancies cannot be depicted. 

The information now at hand is all that is needed in most cases. The 
goodness of fit is determined both by R and tests of significance of the 
reductions in sum of squares. The graph not only gives visual represen- 
tation but serves for interpolation accurately enough for ordinary pur- 
poses. There would seem to be little incentive to derive the equation of 
the curve. 

If the equation is desired, omit the computation of the polynomial 
values in the preceding few paragraphs. After the equation is set up as 
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described below, points on the graph can be computed at will. The 
equation is of the form, r ( 

Y = A + BXi + CX* + . . . 

where A, B, C, . , . are the quantities last calculated in table 14.11, and 
Xi, X 2 , etc., are themselves polynomials in X. It is in this part of the 
computation that X is assumed to occur at unit intervals. If the actual 
intervals are equal but not unity, as for example 30 days or 5 years, they 
may be coded easily by division. Practically, they are simply rewritten 
as 1, 2, 3, ... , the new unit being 30 days or 5 years. 

One last set of formulas is needed: 


X x = X - x 


X 2 = X x 2 


n 2 - 1 
12 


A', = A7 - 2?L_Z (A’i) 


A. = A, 4 - 


20 

3 n 1 - 13 
14 


(AV) + 


3(n 3 - l)(n s - 9) 
560 




18 


1,008 


From table 14.1, x = 11 days and n = 11. Hence, 

Xi = X — 11 

Xi — {X — ll) 2 - 10 = A’ 2 - 22X+ 111 

A, = (X - ll) 3 - 17.8(A - 11) = A 3 — 33A’ 2 +345.2A'— 1,135.2 

X t = (X - ll) 4 - 25 (A - ll) 2 + 72 

= A' 4 - 44A 3 + 70 IX* - 4,774A + 11,688 

Any equation up to the fourth degree may be written from the data now 
before us. As examples, the equation of the best fitting Straight line is, 

¥ = A + BX\ = 0.701273 + 0.235073(A- 11) = 0.2351A -1.885, 
while that of the fourth degree polynomial is, 

Y = A + BXi + CX t + DXi + EXi 
= 0.701273 + 0.235073(A - 11) + 0.046349(A 2 - 22A 
+ 111) + 0.0061907 (A' 3 - 3 3 A 1 + 345.2A - 1,135.2) 

+ 0.00038325 (A 4 - 44A 3 + 701A 2 - 4,774A + 11,688) 

= 0.712 - 0.4772A+ 0.11071A 2 - 0.010672A 3 + 0.0003832A 4 

By substituting the several values of A days in this equation, the poly- 
nomial values may be calculated and plotted for the graph. 
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The student well may question the advisability of fitting curves. A 
stupendous amount of time has been wasted in ill-advised curve fitting. 
Only when the end in view is clear should the task be undertaken. Often 
a graph of the data points is sufficient. Represent them by small circles 
or heavy dots. If desired, they may be connected by light line segments. 



Avoid drawing “eye-fitted” curves. They are highly subjective and are 
apt to be misleading to both the perpetrator and the victim. Interpolation 
with these links stands a better chance of being good than does estimating 
by means of even the most artistic curves. If you know enough about the 
law to say that there ought to be a curve, then you should be able to write 
an equation expressing the law and fit it to the data. If you wish to evalu- 
ate a correlation when regression is nonlinear, or if you wish to test signifi- 
cance, then suitable curved regressions must be fitted. Occasionally, 
fitted curves are required for interpolation. In many cases, graphical 
representation of the data is sufficient. 

EXAMPLE 14.14 — The following five points lie on the parabola, Y 9+2AT— X 1 : 
(0, 9), (1, 10), (2, 9), (3, 6), and (4, 1). Using Fisher’s method, show that the sum of 
squares remaining after fitting the second degree term is zero. Calculate the polynomial 
values. These will be identical with the pairs of values given above. When you plot 
them, the points will lie exactly on the parabola whose equation is given. Derive this 
equation by the use of the formulas up to the second degree. 

EXAMPLE 14.15 — Here are some points on the cubic, Y *= 9X — 6.Y* -f X *: 
(0, 0), (1, 4), (2, 2), (3, 0), (4, 4), (5, 20). Carry through ail the computations by 
Fisher’s method, including the best fitting straight line and parabola. There will be 
no remaining sum of squares after fitting the third degree term and the polynomial 
values at that stage of the fitting will be exactly those you start with. Plot the best 
fitting line and the best fitting parabola and derive their equations. 

EXAMPLE 14.16 — Reed and Holland (13) reported the following average heights 
of sunflowers: 


Week 

1 2 3 4 5 6 

7 8 9 10 

11 

12 

Height 

18 36 68 98 131 170 

206 228 247 250 

254 

254 

Polynomial 





values 

15.0 38.9 68.0 100.6 134.6 168.0 

198.9 225.5 245.6 257.5 

259.1 

248.4 


The polynomial values are those for the cubic. Verify and plot. The remaining sum of 
squares after fitting the cubic is 211.03. Calculate the equation of the polynomial, 
then verify by duplicating the polynomial values given above. 

EXAMPLE 14.17 — The fit of the cubic to the sunflower heights is not entirely 
satisfactory, since it turns downward in the twelfth week. A great deal has been written 
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about a more suitable curve, the logistic , or the curve of autocatalytic growth. An 
excellent account, including bibliography, is given by Pearl (11). 

EXAMPLE 14.18 — Those who deal with dosage-mortality and time-mortality 
curves will find an excellent treatment in three articles by Bliss (1). 
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Chapter 15 

INDIVIDUAL DEGREES OF FREEDOM 


15.1 — Introduction. Mean squares for single degrees of freedom were 
implied in chapters 2 and 4, where two means were compared; and in 
chapter 6, where the regression coefficient was studied. The .F-test was 
applied to corresponding null hypotheses in chapters 10 and 11. Further- 
more, in the latter two chapters the subdivision of degrees of freedom for 
treatments led to the evaluation of one-degree comparisons (section 11.6, 
for example). Some extension and simplification of these earlier methods 
will be made in the present chapter. 

Experiments may be designed, often rather easily, for individual com- 
parisons among the treatments. If so, the sum of squares for n treatments 
may be subdivided into n - 1 parts, each corresponding to a single degree 
of freedom; and these parts may be tested individually against experi- 
mental error. The result may be a notable increase in the information 
furnished by the experiment. Several such experiments will be presented. 


15.2 — A short method of calculating mean squares. Several times 
already we have had occasion to apply a special case of this short method. 
An instance occurred in section 10.5 where the mean square for the 
difference between the means of two lots of hormone-injected chicks was 
got from the sums in this way: 


(1,067 - 616) 2 
22 


9,245.5 


It should be fixed in mind that this method applies only if the two sums 
are made up of the same numbers of individuals; that is, ki = k 2 = k. It 
is informative, also, to know that the divisor has two factors, the formula 
being 

(Si - S 2 y 
2k 


The common size of the two groups is k , whereas the factor, 2, is a special 
case of a theorem to be used below: 2 is a consequence of the fact that the 
variance of differences is twice that of the corresponding individuals (sec- 
tion 11.5 contains an application). 

Even if there are more than two treatments, experiments are often 
designed for individual comparisons. An example is found in section 10.12 
where the concentration of carbon dioxide in soil air was studied. It is 

■ [ 400 ] 
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clearly appropriate to compare the mean concentration in the treated 
plots, 

(86 + 78)/2 * 82,' 

with that in the check plot, 99. The comparison is most readily made by 
use of the sum, 86 + 78 = 164, instead of the mean. First, let us employ 
the familiar methods for calculating a sum of squares (table 10.14, fourth 
step). In each sum, k is the number of items involved. The computation 
set out in the table herewith leads to the sum of squares of deviations from 


Number of 

Plots 

Number of 
Observations 

Sum of 
Observations 

(Sum )'/k 

1 

9 

99 

1,089.00 

2 

18 

164 

1,494.22 

3 

27 

263 

2,583.22 

Correction * (263)*/27 



2,561.81 

Sum of squares 



21.41 


mean, 21.41. For the test of significance, this mean square is compared 
with the error mean square, 48.82, in table 10.20. Since the figure for 
the present individual comparison is less than error, nonsignificance is 
obvious. Nevertheless, you will be struck by the large size of the mean 
square in this comparison as contrasted with the mean square for 2 d.f. in 
the table 10.20. Most of the variation lies between the treated plots and 
check. This feature will be examined further in the next section. 

Now for the new method of computation. If you carry through the 
processes of the foregoing paragraph with literal symbols instead of the 
Arabic numerals, you will reach this formula for the sum of squares: 

(2gi - S 2 - S 3) 2 
6 k 

where £1 = 99, £2 = 86, £3 = 78, and k = 9. By use of this formula the 
calculation is very easy: 

[2(99) - 86 - 78? _ 

6(9) 

Formulas for individual comparisons, such as this one, have led to a 
new manner of thinking about them. One soon learns to contrast the 
sum, S 2 + with twice the compared value, Si. Essentially this is the 
same as subtracting the mean, (S 2 + 5a)/ 2, from S\. What about the 
figure 6? You recall a similar figure 2, in the formula for two groups of 
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equal size. There is an easy rule for arriving at this factor in the denomi- 
nator. First, write down the comparison you intend to make: 

Sy 62 or 2S\ — S 2 S$ 


Then add the squares of the coefficients of the letters: 

12 + (_D* « 2, or (2)2 + ( 1)2 + ( 1)2 - 6 


You will find that this rule always leads to the correct value of the co- 
efficient of k in the denominator. 

A second example involving an experiment with more than two groups 
is found in example 11.17, where four seed treatments for soybeans were 
tried along with no treatment. The numbers of plants emerging in 5 
blocks were, 


Untreated 


Arasan 

469 

Spergon 

459 

Semesan, Jr. 

467 

Fermatc 

471 

Average of treated 



446 


466.5 


Instead of comparing the untreated with the mean of the treated, the new 
scheme is to use a difference which is 4 times as great: 

(469 + 459 + 467 + 471) - 4(446) = 1866 - 1784 = 82 


Symbolically, this may be written, 

S2 + Sz + £4 + £5 — 4-5*1 
the sum of the squares of the coefficients being 

l 2 + 1* + l 2 + l 2 + 4 2 = 20 


Since k = 5, the mean square is, 

(82)2 

(20X5) 


67.24 


This result checks with that of the longer method. 

Please notice that in every comparison the sum of the coefficients is 
zero. This reminds you of the theorem that the sum of the deviations from 
mean is zero. Also worthy of note is the fact that the quantity squared in 
the comparison numerator is always the difference between two sums. 


EXAMPLE 15.1 — For the chick-hormone data, the comb weights of the 2 lots 
(11 chicks each) averaged 97 and 56 mg. Calculate the mean square from the formula, 


k(fi - x t )* 
2 


EXAMPLE 15.2 — The mean numbers of plants emerging from the plots of soy- 
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beans were £i - 446/5 - 89.2, £% = 93.8, £$ - 91.8, 5* « 93.4, and & « 94.2. Cal- 
culate the mean square from the formula, 

H£i + t » + *4 + *1 - 45 ,) 2 
20 

Thus, if means instead of sums are used, A is a multiplier instead of a divisor. The sum 
of the squares of the coefficients remains in the denominator. 

EXAMPLE 15.3 — Each of the following sums is based on 5 items: Si » 20, 
Si «* 30, Si « 35, and Si « 40. Calculate the mean squared for the comparisons: 

1. -f- iS'a — ^S 1 ! — Si. Answer 31.25 

2. 3-Si — — •S’j — Si. Answer 33.75 

EXAMPLE 15.4 — In the first comparison of the preceding example you will get 
the same answer if you consider £, ■* 20 + 30 «* 50 and Si * 35 4- 40 ~ 75 with 
k «= 10 instead of 5, then compare S, with S 2 . Try it. 

EXAMPLE 15.5 — Each of the following is the sum of 4 items: Si ** 12, .S’* =» 18, 
Si *= 20, Si ■■ 30, and Si » 32. The mean square for the comparison, 

25*1 4- 2Si + ZSi - 3^ ~ 3 S>, 

will be found to be 61.63. Here you are comparing the mean of 3 groups with that of 2 
EXAMPLE 15.6 — Derive the two formulas for mean squares, 

k{£ 1 - f ,)* _ J {Si - S t )* 

~2 and 2k 

where £\ Si/k and x% *= Si/k. 

EXAMPLE 15.7 — Derive the formula for the mean square, 

(2 Si - S t - Si)*/ (6k) 

15.3 — Orthogonal sets of comparisons. Professor Fisher and his asso- 
ciates have successfully maintained the thesis, early announced, that for 
each degree of freedom an appropriate sum of squares can be segregated; 
also that the total sum of squares for any group of rz -1 comparisons can be 
partitioned into w -1 parts, each fit for testing one of the set of independent 
individual comparisons. 

For an illustration, let us return to the first part of the foregoing section. 
One of the two degrees of freedom for plots, table 10.20, has been furnished 
with a suitable sum of squares, 21.41. The other may be associated with 
the comparison of concentration of carbon dioxide in the air of the two 
treated plots. The two comparisons may be symbolized thus: 

between treated plots, S 2 — £3 

check vs. treated plots, 2S\ — St — S* 

In the more concise form, the comparisons are: 


Si 

Si 

Si 

0 

-hi 

-1 

2 

-1 

-1 
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Notice two important facts. Not only is each sum of coefficients zero, 
analogous to the sum of deviations from mean; but the sum of the products 
of the corresponding coefficients is zero, 

( 0 ) ( 2 ) + ( 1 )( — 1 ) + ( — 1 )( — 1 ) ** 0 , 

analogous to zero covariance. The absence of correlation between the 
comparisons is the reason they are designated orthogonal . 

Completing the calculations, the comparison between the treated plots 
leads to the sum of squares, 

< 86 - 78 >‘ , 3 56 
(1* + l’)(9) 

Thus the difference between the means for the two treated plots is ob- 
viously not significant. Finally, the sums of squares for the two indepen- 
dent comparisons add to 

3.56 + 21.41 « 24.97, 

which is the sum of squares for the two degrees of freedom for plots in 
table 10.20. This addition theorem is the same as that characterizing 
sums of squares in analysis of variance. We are merely carrying to its 
conclusion the process begun in chapter 10, the matching of sums of squares 
with degrees of freedom. 

You may ask the reason for choosing the particular set of individual 
comparisons presented above. Clearly other sets might be selected. The 
check plot might have been compared with one of the treated, and these 
in turn with the second treated. Theoretically, there is no limit to the 
number of sets of comparisons that may be devised. For practical pur- 
poses, however, the comparisons must be meaningful, ordinarily planned 
as part of the experiment. Extra amounts of information are available 
from experiments designed for the utilization of the individual compari- 
sons. Let us examine one of them. 

The cooperative experiment on the effect of fertilizer, example 11.29, 
is of the type known as factorial. This design will be discussed more at 
length in section 15.9. Here, we notice that the 3 degrees of freedom for 
treatment may be identified with the following comparisons, each sum 
being composed of 16 sampling units: 


Treatment 

Sum (decagrams) 

Check 

291 

N 

313 

P 

312 

NP 

314 

Differ- 

ence 

Mean 

Square 

JV vs. no N 

— 

+ 



+ 

24 

9.00 

P vs. no P 

— 


+ 

+ 

22 

7.56 

Interaction 

4- 

— 


+ 

"20 

6.25 

Sum of squares 



22.81 
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The comparisons are such that the sum of the coefficients in each is 
zero and that the sum of the products in each pair is zero. It is to be 
expected, then, that the sum of the three mean squares will be the same 
as the sum of squares, 22.81, in the original analysis of variance. The 
coefficients of interaction are each the product of those above in the two 
main effects. As may be seen from the signs, interaction compares the 
yields of plots having both fertilizers and neither with those having one 
kind only. 

Each mean square may be tested against experimental error by use of 
F: since all are smaller than error, 31.78, no more than sampling variation 
is indicated. 

As foreseen in section 10.4, you are now able to test as many separate 
null hypotheses as there are degrees of freedom for treatments. This dis- 
closes a fascinating prospect for the design and analysis of experiments. 
Only a glimpse can be given in this text, but some references will be 
found at the end of the chapter. 


EXAMPLE 15.8 — The rat experiment reported in example 11.32 allows further 
subdivisions of the degrees of freedom for treatment. A set of othogonal comparisons 
follows: 


Comparison 

High Level 

Low Level 


Mean 

Square 

Beef 

1000 

Cereal 

859 

Pork 

995 

Beef 

792 

Cereal Pork 
839 787 

Differ- 

ence 

High vs. low 

+ 

4- 

4- 

— 



— 

436 

3,168.3 

Beef vs. pork 

+ 

0 

— 

4- 

0 

— 

10 

2 5 

Interaction with level 

4~ 

0 

— 

— 

0 

4- 

0 

0.0 

Animal vs. vegetable 

+ 

— 2 

4- 

+ 

-2 

4- 

178 

264.0 

Interaction with level 

4- 

— 2 

4- 

— 

4-2 

— 

376 

1,178 1 

(Error mean square, 214.6, dj. =* 54) 

4,612.9 


Here, the 2 d.f. for source of food have been divided into 1 for “Beef vs. Pork,** and 
1 for “Animal vs. Vegetable.** The sum of the mean squares, 264.0 4* 2.5 = 266.5, is 
the same as the sum of squares, “Source,” in the former table — £he mean square is 
266.5/2 = 133 2. A similar subdivision of the interaction has been made. 

The significant interaction of “Animal vs. Vegetable” with level is worth looking at. 
Multiply the sums by the indicated coefficients and make this 2X2 table: 



Animal 

Vegetable 

High 

1,995 

1,718 

Low 

1,579 

1,678 


It is evident that the level of intake of the animal foods (beef and pork) affected the 
rat gains more than did that of the cereal. 
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EXAMPLE 15.9 — Make a 2 X 2 table showing the interaction of level with source 
of food (animal or vegetable), but containing sums only, without multiplication by 2: 

1,995 859 

1,579 839 

Using the methods of chapter 11, calculate the interaction from this table. You get 
the same result, of course, but with more labor. 

15.4 — Nonorthogonal sets of comparisons. Many experiments are 
planned for sets of comparisons which are not orthogonal. This must not 
be considered a fault if the experiment accomplishes its purpose. The sum 
of squares for the set of comparisons will not equal that between means of 
groups, and statements about probability may not be exact, but the com- 
parisons are not lacking in interest. A few examples will suffice to illus- 
trate the situation. 

In the swine nutrition experiment of example 11.10, lot 1 was fed a 
standard corn belt ration. The plan was to compare the gains produced 
by this ration with those of each of the other lots. The three comparisons 
have the following sums of squares: 

(14.64 - 11.95)72(10) = 0.3618** 

(14.64 - 13.25)72(10) - 0.0966 
(14.64 - 16.62)72(10) = 0.1960** 

The mean square for error with 27 d.f. is 0.0231. Only the second com- 
parison is not significant, and its F-value is just short of the 5% point. 

You will observe that the sum of the three mean squares, 0.6544, is 
considerably less than the sum of squares for rations, 1 .1986. A discrepancy 
might have been anticipated by writing down symbolically the compari- 
sons with the first lot: 


1 

2 

3 

4 

+ 

— 

0 

0 

4 - 

0 

— 

0 

+ 

0 

0 

— * 


Although the sum of the coefficients in any line is zero, the sum of the 
products in each pair of lines is different from zero. In the first pair, for 
example, 

(DO) + (-D(O) + (o)(— l) + (o) (o) = l 

The three comparisons are not independent, but they do furnish the 
information for which the experiment was planned. 

Where there is such a standard ration, or a standard variety, as a 
basis for comparisons, it is usually easier to calculate the significant mean 
difference, 


(t M )(sj) - 2.052 \/2 (0 02 3 1 ) / 1 0 = 0.140 pound per day 

You can see readily that if this difference between two means were divided 
by the standard error of the mean difference, the resulting t would be 
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exactly at the 5% level for 27 d.f. Hence, the intervals from 1 .464 — 0.140 
= 1.324 down, and from 1.464 + 0.140 = 1,.604 up, include all means 
differing significantly from the standard. This method of using the i-test 
leads to the identical conclusions derived from the individual comparisons. 

The experiment reported in table 11.13 was so designed as to compare 
each humus and fertilizer treatment with another similar in all respects 
save one. These contrasts with their mean squares are: 

Straw: (230.2 - 156.7) 2 /(2)(12) = 225.09** 

P0 4 : (160.2 - 156.7)7(2)(12) = 0.51 

Lime: (164.6 - 160.2)7(2)(12) = 0.81 

Notice once more the failure of the sum of the three nonorthogonal com- 
parisons to be the same as the sum of squares for humus treatments in 
table 11.14. The three comparisons are, 


Check 

Straw 

Straw 

+ po< 

Straw -\-POi 
4 lime 

4 

— 

0 

0 

0 

4 

— 

0 

0 

0 

4* 

— 


not fulfilling the second requirement for an othogonal set. 

One could easily make up an orthogonal set of comparisons contrasting 
each added treatment with the aggregate of those previously included: 
Straw vs. check, + — 0 0 

PO4 vs. first two, + + — 2 0 

Lime vs. first three, + + + — 3 

It is questionable whether such comparisons, though satisfying the tests 
for orthogonality, have validity in this experiment. For instance, the 
third might result from the contrast of lime with PO4 or with straw or with 
check or with any combination. One must be satisfied with the informa- 
tion furnished by those comparisons which are built into the structure of 
the experiment. 


EXAMPLE 15.10 — Calculate the mean squares for the set of three orthogonal 
comparisons just discussed. From table 11.14 the sum must be 306.24. 

EXAMPLE 15.11 — In example 11.10, suppose the ration fed to lot 2 were con- 
sidered standard. Make the comparisons: 


Lot 


Comparison 1 


1 

2 

3 


0 

0 


Sum 


2 


•f 

+ 

4 


3 

0 

0 


Mean 


4 


Square 


0 0.3618** 

0 0.0845 

1.0904** 


1.5367 


Here, the sum is greater than that for rations in the original analysis. 
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EXAMPLE 15.12 — There are some interesting comparisons in the sugar beet 
experiment of table 12.13. Six of the treatments fall into two groups differing only in 
respect of superphosphate. The treatment, P + N , , cannot be used because there is no 
contrasting treatment with N alone. That is the defect mentioned in example 12.10. 
The other six treatments give comparison 1. The potash and nitrogen comparisons 
lose one plot each for similar reasons. The fourth comparison, the interaction of P with 


Comparison 

Treatment 

Mean 

Square 

None 

p 

k 

pi 

up 

nk 

npk 

1. P 



4 

— 

4 

0 


4 

81.27** 

2. K 

— 

— 

4* 

4 

— 

0 

4 

1.80 

3. N 

0 

— 

— 

— 

4 

4 

4 

5.25 

4. PK 

1 

— 

— 

4 

0 

0 

0 

0.63 

5. NP 

0 

0 

4 

__ 

0 

— 

4 

1.19 


Ky must include only four treatments because there was none with Malone. In an inter- 
action, the plots receiving none and both treatments are compared with those having 
either one or the other. The interaction of N with P involves four treatments all in- 
cluding K. 

The comparisons are not orthogonal and the set is not complete. Perhaps you can 
find some others of interest. An experiment in which the three fertilizers are tried in all 
combinations will be outlined in section 15.9. 

EXAMPLE 15.13 — In the foregoing example, set up one of the interaction com- 
parisons in a 2 X 2 table and evaluate it by the usual computations of analysis of 
variance. 


15.5 — Comparisons not planned before the experiment is per- 
formed. It is not uncommon to plan variety or feeding trials with no 
standard variety or ration. Among the resulting n means there are n — 1 
independent comparisons that may be made, yet the means can be paired 
in n{n — l)/2 different ways. Unless a set of independent comparisons 
was incorporated when the design was decided upon there is a tendency 
to select from the larger number certain comparisons which may reflect 
no more than peculiarities in the data themselves. Now, careful examina- 
tion of results is the investigator’s duty: he should put the means into 
arrays, or segregate those which are alike from those which plainly differ; 
then interpret what he finds in the light of known facts. But he should 
avoid applying tests of significance to his arrangements because they are 
not random. 

It is necessary to distinguish sharply between experiments in which F 
is large and those whose means do not differ significantly. An instance of 
the former was discussed in section 10.4, but such pronounced groupings 
may be lacking. It is common to see an array of variety means having 
almost normal distribution. The test has indicated that they differ sig- 
nificantly, yet there is no way to learn which are from different popula- 
tions and which are from a common one. It is safe to infer that the high 
means are not from the same population as the low, and usually it is 
satisfactory to select an upper fraction for continued trial. Some find 
satisfaction in calculating the significant difference and comparing it with 
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the array, but apparently no connection is established with the 5% 
probability — this probability applies only to random pairs , whereas the 
significant difference extends over an interval in an arrayed sample. For 
such a test the probability is not known. The available information in 
this type of experiment consists of the means, which are unbiased esti- 
mates; the F-test of the hypothesis that these means are drawn from a 
common population; and the fiducial interval. This knowledge is sufficient 
to lead to decisions and to point toward new trials that should be made. 

If testing the null hypothesis has produced a nonsignificant F, it usually 
is not profitable to attempt interpretations of differences among the means. 
Certainly, any oddities should be reviewed and should perhaps be made 
the basis for future experimentation, but they cannot warrant exact 
statements about probability. Reexamination of your sampling in section 
3.2 will likely reveal striking arrangements of means. I selected these 6 
in successive samples: 

33.4, 26.4, 29.3, 30.9, 22.5, 29.3 

Analysis of variance of the 60 pig gains in these 6 lots resulted in the 
following mean squares: 

Means 5 142.7 

Individuals 54 111.7 F=1.28 

If this were the result of an actual experiment, the investigator might be 
tempted to infer a population difference corresponding to the two means, 
22.5 and 33.4 pounds. He might justify this by calculating, 

d = 10.9 lbs., si = v / 2(111.7)/10 = 4.73, t = 2.30, 

with probability about 0.03. But he would be forgetting that he is testing, 
not a random difference but the range of 6 random means. The proper 
test is that of table 5.5: 

ran ge/r/ = 2.30, P = 0.5, 

(or exactly, range/crj = 10.9/4.47 = 2.44). If the ratio, range/j^, were 
greater than the 5% point, 4.04, then in a real experiment there would 
be reason for careful survey of the circumstances, including this one: how 
did it come about that so pronounced a difference was not anticipated? 
Ordinarily, discovery of nonsignificance of F is the end-point of the statis- 
tical examination. 

EXAMPLE 15.14 — Examine the means in the fertilizer experiment of example 
11.28. They fall into three groups: A, BCD , and EFG. Calculate the fiducial half- 
interval, 19.4, and show that the population mean B might lie in either of the more 
extreme groups. What recommendations would you make? 

15.6 — Individual comparisons in regression. The method of fitting 
orthogonal polynomials, described in section 14.6, led to individual com- 
parisons associated with (i) linear regression, (ii) parabolic regression 
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fitted to deviations from linear, etc. Each mean square can be tested for 
significance independently. 

If the values of X are equally spaced, the mean squares for the suc- 
cessive comparisons are easily calculated from tabulated sets of coeffi- 
cients (8) (11). Those in table 15.1 are sufficient for our purposes. To 


TABLE 15.1 

Coefficients and Divisors for Sets of Orthogonal Comparisons in Regression. 
X Is Spaced at Equal Intervals. Read From Top to Bottom of Columns 


Number 

of 

Group 

Degree of Polynomial 

1 

1 

2 

1 

2 

3 

1 

2 

3 

4 

1 

2 

3 

4 5 

1 

— 

— 

+ 

— 3 

4- 

— 

— 2 

4-2 

— 

4- 

-5 

4-5 

— 5 

4“ — 

2 

+ 

0 

-2 

— 

— 

+ 3 

— 

— 

4-2 

— 4 

— 3 

— 

4-7 

-3 4-5 

3 


+ 

+ 

4- 

— 

-3 

0 

— 2 

0 

4-6 

— 

— 4 

4-4 

4-2 — 10 

4 




+ 3 

4- 

4- 

4- 

— 

-2 

— 4 

4- 

-4 

— 4 

4-24-10 

5 







4-2 

4-2 

4- 

4- 

4-3 

— 

— 7 

-3 -5 

6 











4-5 

4-5 

4*5 

4- 4- 

Divisor 

2 

2 

6 

20 

4 

20 

10 

14 

10 

70 

70 

84 

180 

28 252 


illustrate their use, let us recalculate the mean square for linear regression 
of yield of millet on spacing, section 11.7. Since there are five spacings 
(groups), and since we wish only the linear or first degree regression, we 
copy the sums from table 11.11 and the coefficients from table 15.1: 

— 2(1,349) -1(1,314) +0(1,262) +1(1,191) +2(1,188) 

That is, —4,012 + 3,567 — —445 is the difference for the numerator of 
the mean square. The sum of the squares of the coefficients is 10, and k~ 5; 
therefore, 


(-445) 2 
(10) (5) 


3,960.5 


is the mean square attributable to linear regression, the same result as the 
former except for rounding. 

Linear regression may not account for the major portion of the sum 
of squares. An illustration is found in the seasonal yields of oranges at 
Riverside. Since the varieties of section 11.16 were only dummies, the 
data are combined into annual totals: 

1924 1925 1926 1927 

2,313 2,041 1,622 2,487 

The sum of squares for these four groups may be divided into three parts 
explained by first, second, and third degree polynomial regressions. The 
coefficients and divisors are taken from the table ( k = 15): 

, [-3(2,313) -1(2,041) +1(1,622) +3(2,487)]’ 

x ' iiisj 35 
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[+1(2,313) -1(2,041) -1(1,622) +l(2,487)] s 
(4) (15) 

[—1(2,313) +3(2,041) -3(1,622) +1(2, 487)] 2 

( 20 ) ( 15 ) 

Total 


* 21,546 

* 6,826 
28,407 


It is plain that the linear trend is practically horizontal; that the devia- 
tions are fitted quite well by a parabola; that deviations from parabolic 
trend are significant, since F — 6,826/54, d.f. — 1 and 24; and that there 
is no deviation from the cubic. The third statement is based on either of 
two facts: (i) the total sum of squares is completely accounted for by the 
three regressions; or (ii) a cubic may be fitted to any 4 points. 


EXAMPLE 15.15 — Taking days 9-14, inclusive, from the chick embryo data of 
sections 14.2 and 14.6, divide the sum of squares of the six weights, 0.75541, into five 
portions, one corresponding to a polynomial of each degree through the fifth. 


EXAMPLE 15.16 — In the search for causes of lodging in oats, Brady (3) measured 
the thickness of the sclerenchyma cell wall in stems planted 4, 6, and 8 inches apart. 
Twenty-seven plots were planted at each spacing. The mean thicknesses of cell wall* 
were 4.34, 4.69, and 4.67 units. In the analysis of variance, the mean squares for spac- 
ing and error led to F = 1.0465/0.3482 = 3,01, not quite reaching the 5% point for 
d.f . = 2 and 56. (Three varieties each with three spacings were planted in a 9 X 9 
latin square, hence the 56 degrees of freedom for error.) Show that the regression of 
thickness on spacing is significant, F = 4.22, d.f. = 1 and 56. This is always an inter- 
esting possibility. Any one of a group of comparisons may attain significance even 
though the group as a whole falls short. 

EXAMPLE 15.17 — If you wish to duplicate the partitioning of the sum of square* 
of the chick embryo data by using the present method, you will need the coefficient* 
for eleven items. From the references cited, I have copied down those for the first four 
polynomials, together with the embryo dry weights. The answers are at the foot of 
each column: 


Dry Weight i 

Coefficients for Polynomial of Degree 

1 

2 

3 

4 

0 029 

— 5 

15 

-30 ' 

6 

0.052 

-4 

6 

6 

-6 

0.079 

-3 

-1 

22 

-6 

0 125 j 

-2 

- 6 

23 

-1 

0.181 

-1 

- 9 

14 

4 

0 261 

0 

-10 

0 

6 

0.425 

1 

- 9 

-14 

4 

0.738 

2 

- 6 

-23 

-1 

1.130 

3 

- 1 

-22 

-6 

1.882 

4 

6 

- 6 

-6 

2.812 

5 

15 

30 

6 

Divisor 

Sum of square* 

110 

6.0785 

858 

1.8432 

4,290 

0.2368 

286 

0.0060 
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15.7 — Applications to randomized blocks experiments. Among the 
many experiments conducted in randomized blocks, there are some for 
which the straightforward analysis of variance is not conclusive. One 
instance is the experiment contrasting control measures, the data being 
numbers of individuals occurring in the plots. Such data are often 
skewed instead of normal, or exhibit heterogeneous variance, or have 
means not independent of variance. This was discussed in section 11.17, 
and will be considered further in section 16.7, but one method of handling 
the situation may be given here. 

Some data on weed infestation in cereals, reported by Bartlett (2) and 
examined in detail by Cochran (6), arc rearranged by treatment in table 
15.2. While treatment differences arc obviously significant, the error 
variance is not suitable for making critical tests — it is too small for the 
highly variable treatments, A and B , and too large for the others. This 
may be inferred from the ranges in the last line of the table. 

TABLE 15.2 

Number of Poppy Plants in Oats 
Plants per square feet. 


Treatment 


Block 

A 

B 

C 

D 

E 

F 

1 

438 

538 

77 

17 

18 

115 

2 

442 

422 

61 

31 

26 

57 

3 

319 

377 

157 

87 

77 

100 

4 

380 

315 

52 

16 

20 

45 

Sum 

1,579 

1,652 

347 

151 

141 

317 

Range 

123 

223 

105 

71 

59 

70 

Sums of squares: Treatment, 641,023; Error, 39,801. Mean square for error, 2,653. 


The nature of the treatments is not reported, except that A was “no 
treatment.” The data clearly form three pairs, and we shall assume that 
this reflects some relationship in the character of the treatments. The 
method used is based on the fact that the testing of paired data is little 
affected by heterogeneity of variance. 

The following comparisons are indicated in the data: 



A 

B 

C 

D 

E 

F 

1. A vs, B 


+ 

0 

0 

0 

0 

2. C vs. F 

0 

0 

4- 

0 

0 

— 

3. D vs. E 

0 

0 

0 

+ 

— 

0 

4. C -j- F vs. D + E 

0 

0 

4- 


— 

4* 

5. A 4* B vs. others 

2 

2 

— 

— 

— 



These five orthogonal comparisons are applied to the data in each block 
with the results given in table 15.3. As examples: (i) for comparison 1 in 
block 1, — A + B = —438 + 538 = 100; (ii) for comparison 4 in block 
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TABLE 15.3 

Differences for 5 Orthogonal Comparisons on Each Block, and Mean Squares 
for Testing Significance 





Comparison 



Block 

1 

2 

3 

4 

5 

1 

100 

-38 

-1 

157 

1,725 

2 

—20 

4 

5 

61 

1,553 

3 ! 

58 

57 

10 

93 

971 

4 

-65 

7 

-4 

61 

1,257 

Total 

73 

30 

10 

372 

5,506 

Divisor 

2 

2 

2 

4 

12 

Mean square: 
Difference 

666 

112 

12 

8,649 

512 

631,584 

Error 

2,776 

756 

20 

9,203 


4, +C — D — E + F = 52 — 16 — 20 + 45 = 61; (iii) for comparison 
5 in block \>2A + 2B — C— D — E—F~ 876 + 1,076 - 77 - 17 
- 18 - 115 = 1,725. 

In each comparison the total is the same as if the difference had been 
calculated from the sums in table 15.2. 

The mean square for the comparison difference is calculated from the 
total as usual. Thus, for comparison 4, 


(372) 2 

(4X4) 


8,649 


For error, the corrected sum of squares is calculated from the block 
differences, then divided by both the indicated divisor and the 3 degrees 
of freedom for 4 blocks: k = 1. The steps are, for comparison 1: 


1. £v 2 = (100) 2 + . . . + (-65) 2 - (73) 2 /4 - 16,657 

2. Sum of squares = 16,657/2 = 8,328 

3. Mean square = 8,328/3 = 2,776 


A final note on the computation: the five mean squares for treatment 
differences, each with 1 c//., add to 641,023, the same as the sum of squares 
in table 15.2; also, the error mean squares, each the result of dividing a 
sum of squares by 3 dj ., add to 13,267, which is 1/3 of the original sum of 
squares for error as it should be. This verifies both the calculations and the 
orthogonality of the comparisons. 

Anticipated differences among the error mean squares turn out to be 
highly significant (section 10.13), showing that the average error, 2,653, 
of table 15.2 was unsuited for any comparison except the first. Since 
it is assumed that comparison 4, 

C + F - (D + E) = 372 
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reflects a feature of the design (not merely a grouping of fortuitous data), 
it is to be noted that this difference is highly significant when tested against 
its correct error, whereas it would have appeared nonsignificant if com- 
pared to the average error. Quoting Cochran: “This example illustrates 
the most important condition for the application of a combined analysis 


TABLE 15.4 

Weight (ounces) of Asparagus Cut Before June 1 From Plots With 
Various Cutting Treatments 


Block 

Year 

Cutting Ceased 

Total 

Treatment Components 

June 1 June 15 July 1 July 15 

Linear Quadratic 
T l T q 

Cubic 

T c 

1 

1930 

230 

212 

183 

148 

773 

-275 

- 17 

- 5 


1931 

324 

415 

320 

246 

1,305 

-329 

-165 

-207 


1932 

512 

584 

456 

304 

1,856 

-752 

-224 

-176 


1933 

399 

386 

255 

144 

1,184 

-896 

- 98 

-138 



1,465 

1,597 

1,214 

842 


-2,252 

-504 

-526 

2 

1930 

216 

190 

186 

126 

718 i 

-274 

- 34 

4- 78 


1931 

317 

296 

295 

201 

1,109 ; 

-349 

- 73 

4-113 


1932 

448 

471 

387 

289 

1,595 1 

-561 

-121 

- 93 


1933 

361 

280 

187 

83 

911 

-927 

- 23 

- 1 



1,342 

1,237 

1,055 

699 

i 

i 

-2,111 

-251 

4- 97 

3 

1930 

219 

151 

177 

107 

654 

-310 

- 2 

+ 190 


1931 

357 

278 

298 

192 

1,125 

-475 

- 27 

+225 


1932 

496 

399 

427 

271 

1,593 

-647 

- 59 

+309 


1933 

344 

254 

239 

90 

927 

- Ill 

- 59 

+209 



1,416 

1,082 

1,141 

660 


-2,209 

-147 

+933 

4 

1930 

200 

150 

209 

168 

727 

- 37 

4- 9 

+209 


1931 

362 

336 

328 

226 

1,252 

-416 

- 76 

+ 112 


1932 

540 

485 

462 

312 

1,799 

-707 

- 95 

+159 


1933 

381 

279 

244 

168 

1,072 

-674 

+ 26 

+ 108 



1,483 

1,250 

1,243 

874 


-1,834 

-136 

+ 588 

Total 

j 

5,706 

5,166 

4,653 

3,075 

18,600 

-8,406 

— 1 ,038 

1,092 


Preliminary Analysis of Variance 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Blocks 

3 

30,170 

10,057 

Years 

3 

518,722 

172,907 

BY 

9 

5,092 

566 

Treatments 

3 

241,377 

80,459 

TB 

9 

21,861 

2,429 

TY 

9 

51,178 

5,686 

TBY 

27 

10,395 

385 
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of variance: the treatment comparisons must be such that they may all be reasonab- 
ly expected to have the same variance” 

Other applications of individual comparisons are useful if treatments 
at several levels are tried in randomized blocks, or if trends in the yields 
of a perennial crop are the focus of interest. Both occur in an experiment 
by Haber (10) to compare the effects of various cutting treatments on 
asparagus. Planting was in 1927 and cutting began in 1929. One plot 
in each block was cut till June 1, others to June 15, July 1, and July 15. 
The yields in table 15.4 are the weights cut to June 1 in every plot, irre- 
spective of later cuttings in some of them. This weight, then, is a measure 
of vigor, and the experiment is to determine the trend through 4 years. 

The total yields throughout the 4 years, tabulated in a 4 X 4 treatment- 
block table, give lines 1, 4, and 5 in the analysis of variance. As in section 
11.16, the treatment-block interaction is the appropriate error for testing 
treatments, so that the means and mean squares for this 4X4 table might 
be the end-point of the investigation. 

But we wish to examine the trends in yield, not only those caused by 
increasing severity of cutting, but those that may be associated with loss of 
vigor through the years. To this end, the linear regressions of yield on 
treatment are calculated, not only for the totals, but also for every block 
and year separately. On the assumption of equal intervals between cut- 
ting dates, the coefficients for 4 groups (table 15.1) are available. Thus, 
the linear component for block 1 in 1930 is 

— 3(230) - (212) + (183) + 3(148) = -275 

That is, Sxy — —275 if x is taken as —3, —1, 1, and 3. The regression 
coefficient is b — Sxy/Sx 2 = —275/20 = —13.75 ounces per unit of*. 
(Note: there arc 2 units of x for each treatment interval.) 

The regression of yield is uniformly downward (negative). Moreover, 
examination of the linear components indicates that this trend becomes 
more pronounced as the age of the plants increases. Perhaps these features 
will be more evident if the linear components, 7* L , are tabulated in the 
4X4 treatment-block table 15.5. The totals in the last line emphasize 
the steepening downward slope as the years pass: the decrease in vigor 
with increasing severity of cutting is accentuated as the plants grow older. 

Keep in mind that all the entries in table 15.5 are sums of products, 
Sxy. The total, —8,406, could be got directly from the treatment yields 
(table 15.4): 

-3(5,706) - (5,166) + (4,653) + 3(3,075) « -8,406 

The average, ( — 8,406)/16 = —525.4, leads to the regression of yield 
on treatment, b == Sxy/Sx 2 = —525.4/20 = —26.27 ounces per unit 
x = —52.54 ounces per cutting interval (15 days). The associated reduc- 
tion in sum of squares is indicated in the table. The corresponding sum 
of squares for blocks, calculated as explained earlier in this section, results 
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TABLE 15.5 

Linear Component*, Tl, of Regressions, Yield on Date of Cutting 


Block 

Year of Harvest 

Total 

1930 

1931 

1932 

1933 

1 

-275 

-329 

-752 

-896 

-2,252 

2 

-274 

-349 

-561 

-927 

-2,111 

3 

-310 

-475 

-647 

-777 

-2,209 

4 

- 37 

-416 

-707 

-674 

-1,834 

Total 

-896 

-1,569 

-2,667 

-3,274 

-8,406 


(-8,406)* _ 

Tl • (20) (16) 220 > 815 


^ (-2.252P + . . . + (-1,834)» 

TlB . (20)(4) Tl ' 1)323 

^ v . (-896)2 + . . . + (-3,274)2 

r L Y -. (261(4) n " 42)892 

T t fiF: - r t ) - Ti.fi - TiF - 4,756 


Analysis of Variance of Linear Components of Treatments, 7 /, 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

T l 

1 

220,815 

T l B (Error) 

3 

1,323 

Total 

4 

222,138 


Mean 

Square 

220,815 

441 


in the error mean square, 441, against which T L may be tested. Two 
points are worthy of notice: most of the treatment sum of squares, 241,377 
(table 15.4) is due to linear regression; and the mean square, T h B — 441, 
is greatly reduced from TB — 2,429. The small value of T L B leads one 
to observe the uniformity of the block totals in table 15.5 and to look for 
the larger sources of variation in TB. 

Notes on computation: (i) The correction term for T L B turns out to 
be T l . (ii) Since the coefficients, —3, —1, +1, +3, are used to get all 
the entries in table 15.5, the sum of their squares, 20, appears as divisor 
in each fraction, (iii) The other factor in the denominator is, as usual 
in such a tabic, the number of items in the sum which is squared in the 
numerator. 

The treatment-year interaction in the preliminary analysis of variance 
is highly significant, F — 5,686/385 = 14.77. The linear component of 
treatment accounts for a large part of this interaction, as shown by com- 
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parison of T L Y = 42,892 with TY = 51,178. We wish to segregate a 
portion of TBY suitable for testing T L Y, and to learn if the remainder of 
TY is homogeneous. 

Pursuing the objectives mentioned in the first and third paragraphs 
preceding, table 15.6 is constructed by applying the coefficients for 4 


TABLE 15.6 

Regression of Linear Components on Years, TlY 


Block 

Linear, TlYl 

Quadratic, TlYq 

Cubic, TlYc 

1 

-2,286 

- 90 

+648 

2 

-2,171 

-291 

- 17 

3 

-1,573 

4- 35 

4- 49 

4 

-2,202 

4-412 

+236 

Total 

-8,232 

4~ 66 

+916 


groups (table 15.1) to the linear components in table 15.5. As examples 
in block 1 : 


T l Y l ~ — 3( — 275) - (-329) + (-752) + 3(-896) - -2,286 
T l Y q = ( — 275) - (-329) - (-752) + (-896) = - 90 

T l Y c = - (-275) + 3( — 329) -3(-752) + (-896) = 648 


The entries in this table indicate the regressions on years (the trends) of 
the linear components of treatment regressions. Thus, the average, 
— 8,232/4 = —2,058, gives the linear regression coefficient, —2,058/20 
= — 102.9 ounces per half year. The sum of squares for this regression is 


T l Y l : 


( — 8,232) 2 
(20) (20) (4) 


= 42,354 


The denominator contains a factor, 20, for each application of the coeffi- 
cients and 4 because the numerator is the sum of 4 values of T L Y L , Since 
this sum of squares contains most of T L Y = 42,892, it is unnecessary to 
look further for sources of variation. Nevertheless, to illustrate the 
method, we calculate the sum of squares of the deviations from linear 
regression, 

( 66) 2 _ ^ 

lF °’ (20) (4) (4) 

The divisors, 20 and 4, are the same as those in T L Y L , but the sum of the 
squares of the quadratic coefficients is 4 instead of 20. Finally, the sum of 
squares of the deviations from second degree regression is the cubic com- 
ponent, 

(916) 2 

TlYc '- (20) (20) (4) = 524 
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TABLE 15.7 

Analysis op Variance of Components of the Treatment Effects — Asparagus data 


Source of 

Degrees of 



Variation 

Freedom 

Sum of Squares 

Mean Square 


Linear Components, 7T 


Tl 

1 

220,815 

220,815 

t l b 

3 

1,323 

441 

t l y l 

1 

42,354 

42,354 

t l y q 

1 

14 

14 

T l Y c 

1 

524 

524 

t l by l 

3 

802 

267 

t l by q 

3 

3,283 

1,094 

TvBYc 

3 

671 

224 

Sum 

16 

269,786 



Quadratic Components, Tq 


Tq 

1 

16,835 

16,835 

TqB 

3 

5,485 

1,828 

TqY l 

1 

744 

744 

TqYq 

1 

6,440 

6,440 

TqYc 

1 

414 

414 

T q BY l 

3 

927 

309 

TqBYq 

3 

2,117 

706 

TqBYc 

3 

89 

30 

Sum 

16 

33,051 



Cubic Components, Tc 


To 

1 

3,726 

3,726 

T C B 

3 

15,053 

5,018 

T c Y l 

1 

536 

536 

T c Yq 

1 

135 

135 

T c Yc 

1 

16 

16 

T c BY l 

3 

507 

169 

T c BYq 

3 

880 

293 

T c BYc 

3 

1,120 

373 

Sum 

16 

21,973 


Total 

48 

324,810 



One of the divisors, 20, is the same as before, but the other is the sum 
of the squares of the coefficients of the third degree regression, ( — l) 2 
+ (3) 2 + ( — 3) 2 + (1) = 20. (If there were 5 years instead of 4, this 
would be 10; see table 15.1). We now verify the orthogonality of T L y L , 
T L Y Q , and T l Y c by addition of the sums of squares: 

42,354 + 14 + 524 = 42,892 
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For testing the three mean squares just isolated, there are correspond- 
ing components of TifiY . These are calculated from the columns of 
table 15.6 in the usual manner; for instance, in the middle column, 


T l BY q : 


(-90) 2 +... + (412) 2 
(20) (4) 


~t l y q 


3,283 


The sum of this and the other two components, 802 and 671, is T L BY 
= 4,756. 

Summarized in the first part of table 15.7 are the computations de- 
scribed above. Among the regressions of T L on years, only the linear 
component is significant. Since the 3 components of T L BY do not differ 
significantly (Bartlett’s test, section 10.13, gives x 2 — 2.12, d.f. = 2), it is 
right to use the mean square, 4,756/9 = 528, to test all the components 
of T L Y. 

Some heterogeneity remains to be located, so we proceed to isolate the 
quadratic and cubic components of T. The procedure follows closely 
that already described, only the coefficients and corresponding divisors 
being changed. For 


T q : 


( — 1,038) 2 
(4) (16) 


16,835 


This is the reduction in sum of squares due to fitting a second degree 
polynomial (parabola) to the deviations from linear regression. For 


. (1,092) 

* (20) (16) 


Verification: T L + T Q + T c = 220,815 + 16,835 + 3,726 = 241,376= T. 

The corresponding components of TB , T Q Y> and T Q BY are calculated 
from a table like 15.5 except that its entries are the quadratic components, 
Tq. A third table with T c yields T C B , T C Y , and T C BY. From each of 
these tables is made one like 15.6 and from them the computations are 
completed (table 15.7). The results are all recorded for the guidance 
of the student though only a few of them indicate more than sampling 
variation. We shall look at these few and condense the table into a form 
which would be suitable for publication — table 15.8. 

First, look at the components of TB (mean squares/: 


T l B = 441 

T q B = 1,828 
T C B = 5,018 


Despite the discrepancies, the components do not differ significantly, 
chi-square being at about the 20% point. If you plot the four treatment 
means in each block, you will observe the following: (i) The linear regres- 
sions of yield on treatment are quite uniform, especially in the first three 
blocks (observe the sums of products in column T h of table 15.4, —2,252, 
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TABLE 15.8 

Final Analysis of Variance of the Asparagus Data 


Source of Variation 

Degrees of Freedom 

Mean Square 

Blocks 

3 

10,057 

Years 

3 

172,907 

BY 

Treatments: 

9 

566 

Linear, 7L 

1 

220,815 

16,835 

Quadratic, Tq 

1 

Error for treatments, TB 
Trends: 

9 

2,429 

Linear, T l Yl 

1 

42,354 

Quadratic, TqYq 

1 

6,440 

Error for trends, TBY 

27 

385 

Homogeneous effects 

8 


Total 

63 



etc.), hence the small T L B. (ii) The parabolic deviations are noticeable 
in block 1, but decrease in the other blocks. Treatment 1 is uniformly 
high in all blocks and treatment 4 is low, but the relative positions of the 
intermediate treatments give rise to the variations in T Q B and T C B . 
Cutting until June 15 did not affect the vigor of the plants in block 1, 
but in block 3 this treatment was more severe than the cutting until 
July 1. Apparently, all this can be attributed to the available nutrients in 
the plots, (iii) The notable differences in the cubic components, from 
— 526 in block 1 to +933 in block 3, are due chiefly to the behavior of 
plots 2 and 3. Since there is a tendency toward uniformity from year to 
year within the blocks, this also seems traceable to differential plot 
fertility. 

The data suggest that the relation between vigor and date of final 
cutting is critically associated with fertility. But fertility was not con- 
trolled in this experiment; hence, the appropriate error for treatments 
should include ail components of TB. This reasoning, added to the non- 
significance of the differences among the components, leads us to use the 
pooled mean square, 2,429, as error for all treatment components. 

As for the nine components of the second order interaction, they are 
notably homogeneous, chi-square being near the 40% point. The error 
for all first order interactions, then, is taken as TBY = 385. 

The obvious linear decrease in vigor with prolongation of date of 
final cutting, T Li and its equally obvious downward trend, T L Y L , have 
been commented on. There remain only two effects to be discussed. 

1 . The deviations of the treatment means from their linear regression 
show the increasingly detrimental effects of prolonged cutting. The 
means, from the row of totals in table 15.4, are 356.6, 322.9, 290.8, and 
192.2. The successive differences measure the departure from linearity; 
33.7, 32. and 98.6. The augmented decrease in yield for the July 15 
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cutting is a feature of all blocks except the first. T Q = 16,835, a signifi- 
cant variance, is the average square of that portion of the deviations from 
linearity explained by parabolic regression. A graph of the treatment 
means makes this feature more vivid. 

2. The highly significant T Q Y Q = 6,440 results from a parabolic trend 
in the quadratic components of the treatment regressions. To see this, 
accumulate T Q by years: 

1930 1931 1932 1933 

— 44 -341 -499 -154 

These quadratic portions of the deviations of the treatment yields from 
linear regression, T Q , do not exhibit much linear trend, but do show pro- 
nounced deviations which are well described as parabolic. Straight lines 
fitted the treatment yields fairly well in 1930 and 1933, but in the two 
intermediate years the June 15 and July 1 dates had yields well above 
the linear regressions. The interpretation is that in the more favorable 
seasons of 1931 and 1932 the effects of the June 15 and July 1 cuttings 
were relatively less harmful than in the years of low yield, 1930 (the plants 
were young, too) and 1933. 

EXAMPLE 15.18 — Notice the total sum of squares for 48 degrees of freedom in 
table 15.7. Of what parts of the analysis of variance in tabic 15.4 is this a re-analysis? 

EXAMPLE 15.19 — Verify and fix in mind the meaning of an equation like the 
following, the terms being sums of squares: 

T q Y l -f T q Yq + TqY c - T q Y 

EXAMPLE 15 20 — For each year, calculate the regression of yield on treatment 
date, using as a unit 15 days* 1 corresponds to June 1, 2 to June 15, etc. For 1930, 
Y == 179.5 — 22.4Q(X — 2 5). Plot each regression together with the treatment means. 
Verify the explanation of TqYq . 

EXAMPLE 15.21 — To the treatment yields for 1932 fit linear, quadratic and cubic 
regressions as in section 14.6. 

15.8 — Switch-back or reversal experiments. Brandt (4) has shown 
that individual comparisons are convenient for evaluating the results of 
those experiments where two treatments are tried alternately on two groups 
of animals. Such are especially common in investigations of the effect of 
treatment on milk flow. An example will serve to explain the situation. 

The milk yields in table 15.9 are data from an experiment (1) to de- 
termine the value of a gluten supplement to the grain mixture in feeding 
dairy cows. In order to obviate the effects of the normal decline in milk 
flow, the feeds were tried alternately in three periods of 35 days each. 
As indicated in the table, lot A received the grain and gluten ( GG ) during 
the first period, while lot B had only grain ( G ). In the second period these 
parts of the rations were interchanged. 

The comparisons are by period. Hence, for each animal, 

a 0 —c 
a —2 b +c 


and 



422 


Statistical Methods 


Section 15,8 


TABLE 15.9 

Pounds of Milk Produced by Two Lots of Holstein-Friesians During Each of 
Three Periods. Two Rations, One Containing Grain (G), the Other Grain and 

a Gluten Feed (GG) 


: 

Lot 

Cow 

Number 

Period 

Comparison 
a — 2b + c 

Sum 

a 

b 

c 



(GG) 

(G) 

(GG) 



A 

1 

671 

610 

597 

48 



2 

615 

555 

488 

- 7 



3 

111 

733 

694 

5 



4 

1,102 

959 

940 

124 



5 

764 

718 

717 

45 

215 



(G) 

(GG) 

(G) 



B 

6 

433 

414 

363 

-32 



7 

745 

797 

780 

-69 



8 

858 

754 

680 

30 



9 

977 

1,025 

1,007 

-66 



10 

655 

616 

495 

-82 

-219 


give complete information. The first comparison measures the obvious 
decrease in milk flow, not part of the object of this experiment. The 
second, however, gives the deviation of the milk yield in the middle 
period as measured from the linear regression. Its value for each cow is 
recorded in the table, sums for each lot appearing in the last column. For 
lot A these sums are ( GG ) — (£), while for lot B they are the negatives, 
( G ) — (GG). The difference, therefore, 

215 - (-219) = 434, 


represents the total pounds of milk attributable to the gluten feed, assum- 
ing no carry-over from the preceding period. Brandt showed that this 
difference is properly tested for significance by applying the methods of 
sections 4.2 and 15.2. The error variance comes from the pooling of the 
sums of squares in the two lots of comparisons. In lot A , for example, 

£*2 = (48)2 + ( — 7) 2 + (5) 2 + (124)2 + (45)2 _ (215) 2 /5 « 10,534 


The corresponding sum for lot B is 8,173. The error variance is, then, 


10,534 + 8,173 
4 + 4 


2,338 


For the comparison of gluten and nongluten supplements, the sum of 
squares is, 


[215 - ( — 21 9)] 2 
2(5) 


18,836 


Hence, F — 18,836/2,338 = 8.06, significant for d.f. = 1 and 8. 

This ingenious method of testing significance in reversal trials may be 
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extended to any number of trial periods. If there were four periods, for 
example, the comparison for third degree polynomial, table 15.1, would 
be used. 

EXAMPLE 15.22 — In table 15.9, run up the sums in the six groups of milk yields 
and represent them graphically. In lot A , the sum in period b falls below the line con- 
necting the sums in a and c. The opposite condition is found in lot B. 

EXAMPLE 15.23 — Cannon, Hansen, and O’Neil (5) tried the effect which two 
methods of watering dairy cows, inside the barn (/) and outside (0), might have on 
butterfat production. The pounds of butterfat yielded by ten cows in two lots at each 
of four periods are tabulated : 



Period 

Lot 

a 

b 


c 

d 

A 

u) 

(0) 


a) 

(0) 


40 

32 


28 

14 


27 

19 


20 

14 


35 

27 


31 

25 


30 

28 


27 

26 

■ 

37 

30 


32 

27 

B 

0 0 ) 

(/) 


(O) 

(/) 


51 

55 


40 

40 


32 

32 


32 

28 


36 

38 


29 

31 


32 

30 


25 

25 


21 

22 


15 

15 


Compute for each cow the comparison for four periods (table 15.1), 

-f 36 - 3c -f d 

The sums are for lot A, —69, and for lot B, 75. Calculate F ** 2,074/132.6 = 15.6, 
highly significant for d.j. = 1 and 8. 


15.9 — Factorial experiments. A comprehensive and efficient kind of 
experiment, one that will probably increase in popularity, is the factorial 
(7) (12), in which two or more series of treatments are tried in all com- 
binations. For example, three sources of protein might be used in a feed- 
ing experiment, each at two levels. The six combinations of this pair of 
series, source (s) and level (/), are 

5111 S2I1 S3I1 

5112 S2I2 SzU 

Each combination might be applied to a lot of, say, 10 animals, with the 
resulting analysis of degrees of freedom, 

Treatment combinations, 5 
Error 54 

Information about the effect of level is furnished by every animal, because 
30 receive the low level and 30 the high. The comparison of levels corre- 
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spends with one of the five degrees of freedom for treatments. Further- 
more, the source of protein might be selected for two orthogonal com- 
parisons, such as 

— Si + Sz 

4 * — 2st 

The remaining two degrees of freedom correspond to the interactions in 
the 2X3 table. 

One form of the factorial experiment is that illustrated in table 15.10 
in which each series of treatments is at two levels; in this case, none and 

TABLE 15.10 

Yields of Wheat in a 2 s Factorial Experiment Laid Down in a Latin Square. 
Nitrogen (/i), Phosphorous ( p) y and Potassium ( k ) Were Tried at Two Levels, 
None and Some. (1) Indicates No Fertilizer 


p 

n 

np 

k 

nk 

a) 

npk 

pk 

18.8 

12.2 

18.3 

15.8 

11.4 

11 5 

19 4 

18 9 

n 

nk 

pk 

npk 

P 

k 

np 

a) 

12.9 

7.3 

17.4 

17.2 

19.7 

12.0 

19 0 

15 6 

nk 

np 

n 

P 

a) 

npk 

pk 

k 

10.7 

17.5 

10.4 

18.0 

9 8 

16 6 

17.5 

14 3 

pk 

k 

npk 

a) 

n 

np 

P 

nk 

18.3 

12.6 

14.2 

12.2 

11.4 

14.5 

16.9 

16.1 

np 

(1) 

nk 

n 

pk 

P 

k 

npk 

17.9 

12.8 

13.3 

11 3 

16 5 

15 6 

10.9 

16.7 

k 

pk 

(1) 

np 

npk 

15.8 

n 

nk 

P 

14.9 

18.2 

12 8 

17.1 

9 5 

8 9 

20.6 

npk 

P 

k 

pk 

np 

nk 

0) 

n 

19.0 

18.9 

11.2 

17.1 

17.9 

8.6 

10.2 

14.5 

(i) 

npk 

P 

nk 

k 

pk 

n 

np 

17.5 

20.4 

20.8 

16.4 

16.8 

18 5 

13.6 

23.0 


Treatment Sums 


(1) 102 4 

p 149.3 

np 145.2 

pk 142 4 

n 95.8 

k 108.5 

nk 92.7 

npk 139.3 

Preliminary Analysis of Variance 


Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Treatments 

7 

513.79 

73.4 

Error 

42 


2.190 
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some. The three series — nitrogen, phosphorous, and potassium fertilizers — 
make 2* = 8 combinations, 


nopoks 

nopoki 

nopiko 

nop iki 

nipoko 

nipoki 

nipiko 

nipiki 


The symbols for the combinations are usually abbreviated, as shown in 
the table, the check (control or no treatment) being represented by (1). 

Factorial experiments may be laid out in any convenient design, such 
as randomized blocks; in the one illustrated, Goulden (9) used the 8X8 
latin square. The experimental error, 2.190, d.f . = 42, is available for 
testing each of the 7 treatment comparisons. Sometimes there is no repli- 
cation, the assumption being that one or more of the higher order inter- 
actions contains only the random variation suitable for estimating error 
(see section 11.14). 

The treatment sums, shown in the table, may be compared as follows: 


Effects of phosphorous: 






acting alone 

P ~ 

(0- 

149.3 - 

- 102.4 = 

46.9 

with nitrogen, 

fin 

— n = 

145.2 - 

- 95.8 = 

49.4 

with potash, 

pk 

- k = 

142.4 - 

- 108.5 = 

33.9 

with both, 

pnk - 

- nk = 

139.3 - 

- 92.7 = 

46.6 


Total 176.8 


It is evident that phosphorous always increased the yield of wheat on 
these plots. Since there were 32 plots at each level, the average increase 
per pair was 176.8/32 = 5.525. This mean difference has the standard 
error, 

Si = v / 2(2.190)/32 = 0.3700, 

so that t = 5.525/0.3700 = 14.93 leaves no doubt about the effectiveness 
of phosphorous under the conditions of this experiment. 

But, you may object, the phosphorous comparison has been made in 
four different circumstances so that the average effect may not be fully 
informative. The objection is valid if the mean differences (46.9/8, etc.), 
5.86, 6.18, 4.24, and 5.82, reflect the presence of the other fertilizers. As a 
preliminary test, compare the range, 6.18 — 4.24 = 1.94, with its 5% 
level, table 5.5. Since sj = \/2(2.190)/8 = 0.740, the 5% level of range 
is approximated by (0.740) (3. 56) = 2.70; hence, the mean differences 
may be random samples from a common population. Some more dis- 
criminating tests will now be discussed. 

Begin by looking at the difference between the effects of phosphorous 
(i) acting alone and (ii) in the presence of nitrogen, 49.4 — 46.9 = 2.5. 
We have noticed this kind of difference before, and have called it inter- 
action . Continuing, we have, 
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Interactions of phosphorous: 

with nitrogen alone 49,4 — 46.9 = 2.5 

with nitrogen in the presence of potash 46.6 — 33.9 = 12.7 
Total UJ 

Assuming again that these interactions differ only as in sampling, the 
average interaction is 15.2/32 = 0.475, so that t = 0.475/0.3700 = 1.28, 
nonsignificant. 

But perhaps the two interactions differ significantly. As we have had 
occasion to note before (section 11.14), the difference between two inter- 
actions is called second order interaction , the interaction among the three 
factors, phosphorous, nitrogen and potash. The average difference is only 
(12.7 - 2.5J/32 = 0.319; hence, t = 0.319/0.370 = 0.86 falls short of 
significance. 

We have now learned that phosphorous uniformly increases yield under 
the circumstances of this experiment, and that its action may be unaffected 
by the addition of nitrogen or potash. 

The effects of nitrogen may be examined in the same way, but a more 
compact scheme (table 15.11) is adequate after you have become ac- 
customed to the reasoning involved. Each difference, recorded under JV, 
is the effect of nitrogen alone or in the presence of k, or pk. Since the 


TABLE 15.11 

Effects of Nitrogen in Wheat Experiment 


Treatment 

Sum of Yields 

N 

NP 

NPK 

0) 

102 4 

- 6 6 

2.5 

10.2 

n 

95.8 




P 

149 3 

- 4.1 



np 

145 2 




k 

108.5 

— 15 8 

12.7 


nk 

92.7 




pk 

142.4 

- 3.1 



npk 

139.3 




Total 


-29.6 

15.2 

10 2 


differences do not vary significantly, they are combined in the lower line 
of the table. The totals represent the effects of nitrogen, N\ the inter- 
action, NP\ and the second order interaction NPK: the latter two are 
identical with the interactions in the phosphorous series. Notice that 
the lower case letters represent the treatments, while the capitals indicate 
effects or differences. 

Instead of dividing each sum by 32 and computing t , it is easier to 
calculate the significant difference between sums , using /.os = 2.018: 


2.018V (32)(2)(2.190) = 23.89 
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Comparison of this with N shows that the nitrogen effect is a bit larger 
than the 5% level of significance: the addition of nitrogen may be ex- 
pected to depress yields in the population of which this sample is repre- 
sentative. 

The test of NPK is the test of the significance of the difference between 
the two components of NP, the interaction of nitrogen and phosphorous 
(i) alone, and (ii) in the presence of potash. If NPK is evidently a popula- 
tion characteristic the investigator is warned that the nitrogen-phos- 
phorous interaction is different with potash and without, so that the aver- 
age effect, NP, may not be informative. He may make one recommenda- 
tion about n and p if k is to be used and a different one if A: is not; or he 
may have to warn the farmer, for example, that n and p should be used 
together only if A; is added. 

If NPK can be assumed negligible in the population; that is, if the two 
parts of NP are regarded as estimates of the same population difference, 
then NP is a homogeneous estimate of the difference between the effects 
of nitrogen, (i) with phosphorous and (ii) without, irrespective of the 
presence or absence of potash. In making this statement, the effect of 
potash is ignored — it may be great or it may be, as in the present experi- 
ment, negligible. The statement implies that, for evaluating the inter- 
action, NP , the upper and lower halves of the table may be superimposed. 
If NP is negligible, as here, the recommendation about nitrogen can be 
made without reference to phosphorous; but if NP were large, recom- 
mendations about the two fertilizers would have to be related. In other 
words, in the first case the average effect of N is the appropriate estimate; 
whereas, in the second case the average effect may not be interesting — 
the effect with phosphorous would be different from that without. 

Three tables like 15.11 are necessary to isolate the seven othogonal 
comparisons among the eight treatments. The results are summarized 
in table 15.12. Aside from N and P, none of the effects are significant. 
Note: three tables like 15.11 will give all the effects if the order of the 
treatments is properly selected. After (1 )-«-/> try (1 )-/>-&, then (1 )-k-n. 

Referring again to table 15.11, it will be evident that N is the differ- 


TABLE 15.12 

Seven Effects Among the Eight Treatment Combinations 
in the Wheat Experiment 


Effect 

Difference 

Effect 

Difference 


N 

-29.6 

NK 

- 8.2 


P 

176.8 

PK 

-15 8 


K 

- 9.8 

NPK 

10.2 


NP 

15.2 





5% significant difference, 23.89 
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cnee between the yields of the 32 plots receiving nitrogen and the 32 
without it: 

(95.8+145.2+92.7 + 139.3) — (102.4+149.3+108.5+142.4) = -29.6 

Again, NP is the difference between the yields of the 32 plots having 
both nitrogen and phosphorous or neither and of the 32 having either nitrogen or 
phosphorous but not both: 

(102.4+145.2+108.5 + 139.3) - (95.8 + 149.3+92.7+142.4) = 15.2 

Finally, NPK is the difference between the yields of the 32 plots having 
either one or three of the fertilizers applied as against the 32 plots having 
either none or two : 

(95.8+149.3 + 108.5 + 139.3) - (102.4+145.2 + 92.7 + 142.4) = 10.2 

From such rules, all the effects may be calculated without tables such as 
15.11, but I do not advise it except as verification — some of the important 
features of the experiment may be overlooked. 

The foregoing formulas are summarized in table 15.13. For the main 
effects , TV, P, and K, mark with + all combinations having the fertilizer 
and with — those lacking it. The signs for the interactions are got by 
applying the rule for multiplication, like signs give +, unlike — . Thus: 

N: -+--++-+ 

K : + - + + + 

NK : +- + -- + - + 

Notice the rhythm in each line: it will help prevent mistakes. Verify the 
signs in the interactions by comparing them with the rules preceding. 

The difference for each effect leads to a mean square in the manner 


TABLE 15.13 

Coefficients of Individual Comparisons Summarizing Formulas 
for the Treatment Effects 


Treatments 

a) 

n 

P 

k 

np 

nk 

pk 

npk 

Differ- 

ences 

Sum of Yields 

102 4 

95 8 

149 3 

108 5 

145 2 

92 7 

142 4 

139 3 

Comparisons: 










N 

— * 


— 

— 

4- 

-f 

— 

+ 

-29.6 

P 

— 

— 

+ 


+ 

— 

4~ 

+ 

176.8 

K 

— 

— 

— 

+ 

— 

+ 

+ 

+ 

- 9 8 

NP 

+ 

— 

— 

+ 

+ 

— 

— 

+ 

15.2 

NK 


— 

+ 

— 

— 

+ 

— 

+ 

- 8.2 

PK 

+ 

4~ 

— 

— 

— 

— 

+ 

+ 

-15.8 

NPK 

. 

4* 

+ 

+ 


— 


4- 

10.2 
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already familiar: k = 8 and the sum of the squares of the coefficients is 
also 8. Taking NP as an example, 


(15.2)’ 

( 8 ) ( 8 ) 


3.61 


This mean square is compared with experimental error by means of F : 


F = 3.61/2.190 - 1.65 ' 


The test is, of course, identical with that using t . Verification: \/l.6S 
= 1.28 — t. In this factorial experiment the t test is much easier, espe- 
cially if one uses the significant difference between sums. But as a check 
on the computations, as well as a matter of interest, the sum of the seven 
mean squares may be calculated and compared with 513.79 in table 15.10: 

(— 29.6) 8 + ■ ■ ■ + (10-2) 2 _ 

( 8 )( 8 ) 

With experience and skill in the interpretation of data from factorials, 
construction of tables like 15.11 and 15.13 may be unnecessary. Yates (12) 
devised the ingenious method shown in table 15.14. The treatment sums 


TABLE 15.14 


Yates’ Device for Computing Effects 


Treatments 

Sums 

1 

2 

3 

Effects 

a) 

102.4 

198 2 

492 7 

975 6 


n 

95 8 

294 5 

482 9 

-29.6 

N 

P 

149.3 

201.2 

-10 7 

176.8 

P 

np 

145 2 

281.7 

-18.9 

15.2 

NP 

k 

108 5 

— 6.6 

96.3 

- 9 8 

K 

nk 

92.7 

- 4 1 

80 5 

- 8.2 

NK 

pk 

142 4 

-15.8 

2 5 

-15 8 

PK 

npk 

139.3 

- 3.1 

12.7 

10.2 

NPK 

Total 

975.6 




are entered in a characteristic order, any two treatments like n and p 

being followed by np , while k is then followed by nk and pk. In column 1 

are successively: 

4 sums like 95.8 + 102.4 = 198.2 
4 differences like 95.8 — 102.4 = —6.6 
Column 2 is made from column 1 in the same fashion: 

4 sums like 294.5 + 198.2 = 492.7 

4 differences like 294.5 — 198.2 = 96.3 

Finally, column 3 is got from column 2 by the same sequence. 

The first entry in column 3 is the total yield for the experiment, 
furnishing a check on the calculations to that point. Other methods of 
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verification may be found in Yates’ article. The 7 differences may be 
tested by t or F as preferred. 

The conclusion from this experiment is that in the region of which it is 
representative phosphorous alone may be recommended to increase the 
yield of wheat. Warning: do not proceed immediately to summarize a 
factorial experiment in a table like 15.13 or 15.14. First examine every 
detail of the comparisons in order to gain all the information. 

EXAMPLE 15.24 — The 2X2 table is often effective in studying the results of 
factorial experiments. The yields from a lime-inoculation trial on legumes might look 
like this: 


No Lime Lime 


No inoculation 40 60 

Inoculated 80 200 


If the entries are the sums from five replications, show that the average interaction is 10. 

EXAMPLE 15.25 — The results of a trial of nitrogen and phosphorous in ten repli- 
cations might be like this: 


(1) p 


(1) 600 800 

n 800 900 


Show that average NP * — 5, despite the greater yields of the plots having both. 
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Chapter 16 


BINOMIAL AND POISSON DISTRIBUTIONS 

16.1 — Introduction. Small samples from these distributions were 
dealt with in chapters 1 and 9. We are now ready for the discussion of 
some features of large samples of enumeration data similar to the large 
sample methods for measurement data presented in chapter 8. Since in 
these distributions the mean and the variance are not independent, trans- 
formations may be necessary before analysis of variance is applied. Three 
of these are described in the closing section. 

16.2 — The binomial distribution. If the incidence of some event 
like survival or male sex is observed in subsamples of uniform size, the sub- 
samples may be classified according to the number of times the event 
occurs. The resulting frequency distribution may be a sample from a 
binomial distribution , a distribution in which the probability of the occur- 
rence of the event is the same for every individual. 

As an example, suppose lots (subsamples) consisting of four full sister 
chicks are inoculated with an organism producing equal chances of life 
or death for every chick. The event of survival in the lots may occur any 
number of times from none to four. If 96 lots are injected, some such record 
as this may result: 


Number of Surviving 

Chicks in Subsample 

Number of Subiamples With 

Each Record of Survivals 

0 

1 

1 

20 

2 

41 

3 

25 

4 

9 

Total 

,96 


Clearly, no survivals in a lot, as well as four survivals, Ls unlikely. The 
modal number of living chicks is two. 

The two specifications, four individuals in each subsample and 50% 
probability of survival, completely determine a binomial distribution. 
Most conveniently, the relative frequency of occurrence of 0, 1, 2, 3, and 4 
survivors is given by the successive terms of the expansion of the binomial, * 


* Look up the topics, binomial theorem , probability , sampling , permutations and combina - 
tions in any college algebra. 


[ 451 ] 
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This means that one-sixteenth of the subsamples in the population will 
have no survivals, one quarter will have one, etc. Combining this informa- 
tion with the sample fact of 96 lots, the expected frequencies in the classes are 



= 6, etc.; that is, 6, 24, 36, 24, and 6. 


The discrepancies be- 


tween these and the sample frequencies may be due to the vagaries of 
sampling or to a mistaken assumption about the probability of death 
among the chicks. These possibilities will be investigated presently. 

The general statement of the theory is this: if there is a constant prob- 
ability, />, in random subsamples of k , the relative frequency of 0, 1, 2, . . . 
occurrences is given by successive terms of the binomial 

(? + py, 

where q = 1 — p is the probability that the event will not occur; the 
expected frequencies in the classes of the sample are the products of these 
relative frequencies by rz. 

You may find it profitable to verify this theory by sampling a homoge- 
neous population. Selecting a lot of beans of uniform size, shape and 
texture, color (or otherwise mark) some fraction of them, say one-third, 
then mix thoroughly. Draw subsamples of five, for example, by some 
random method of selecting one bean at a time, noting its color then re- 
placing it among the others. Continue after re-mixing each time. The 
probability of getting 0, 1, 2, 3, 4, or 5 colored beans is taken from the 
successive terms of the expansion, p = 1/3, q — 2/3, k = 5: 



If you carry on to 243 subsamples, n = 243, the expected frequencies will 
be the numerators of the six fractions. 

Instead of beans, the random digits of table 1.2 may be used. The 
digits 3, 6, and 9 take the place of colored beans, while 1, 2, 4, 5, 7, and 8 
stand for white: 0 is discarded when drawn. By some such convention, 
any desired probability may be specified. 
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The binomial coefficients , for the formation of which you will find several 
rules, are easily remembered by writing them down this way: 


Size of Subsample 
k 


Binomial Coefficients 


1 



1 1 


2 



1 2 1 


3 


1 

3 3 1 


4 

1 


4 6 4 

1 

5 

1 

5 

10 10 5 

1 

6 

1 6 


15 20 15 

6 1 

7 

1 7 

21 

35 35 21 

7 1 

8 

1 8 28 


56 70 56 

28 8 1 

etc. 



etc. 



In this form, called Pascal's triangle , each coefficient is the sum of the two 
just above it to the right and left. The coefficients for k = 4 and k = 5 
verify those used above for the chicks and the beans. You see, the distri- 
bution of the coefficients is always symmetrical. If p = q = 0.5 the bi- 
nomial distribution is also symmetrical, the shape approximating more 
and more closely that of the normal distribution as k gets larger. If the 
probabilities are unequal, the binomial distribution is skewed. 

In the binomial distribution, as in the normal, the mean and standard 
deviation are appropriate averages. Setting up the expected numbers in the 
usual manner for calculation (section 8.2): 


Number of 
Living Chicks 

X 

Expected 

Frequency 

jx 

/X* 


0 

6 

0 

0 


1 

24 

24 

24 


2 

36 

72 

144 


3 

24 

72 

216 


4 

6 

24 

96 


Total 

96 

192 

480 



m ■* 192/96 ** 2 chicks per lot 


The symbols m and a for mean and standard deviation are used to dis- 
tinguish these population parameters from the sample statistics, x and s. 
The latter, calculated from the observed frequencies, are 2.2188 chicks per 
lot and 0.920 chick per lot: we shall have little use for these statistics. 
Another point you will notice is that for <r, division was by n = 96 instead 
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of by n — 1 =95. Degrees of freedom are appropriate for computing 
sample statistics but not parameters. 

One usually finds these averages expressed as proportions or as per- 
centages rather than as actual numbers per lot. Thus, one reads that the 
proportion of living chicks expected is m = 2/4 = 0.5 with cr = 0.25; or 
that the percentage is 50% of living chicks with standard deviation 25%. 
This emancipates the averages from the subsample size, making those for 
lots of four directly comparable with similar ones for lots of any other size. 

On account of mathematical relations connected with the binomial 
distribution, it is unnecessary to go through the calculations shown above. 
The parameters are got more easily from the formulas, 

m = kp = (4) (0.5) = 2 chicks per lot 

<r = y/kpq = \/(4)(0.5)(0.5) = 1 chick per lot 

If the parameters are to be expressed as proportions or percentages, the 
formulas are 


m — p — 0.5 = 50%, 

o- = y/jftjk = ^(01X05)74 = 0.25 = 25% 

Thus, p is not only the probability of occurrence of an event but is also 
the population mean proportion or percentage of occurrences. 

It is worth noticing that, if the subsample size were 16 chicks, the ex- 
pected mean percentage would be the same, p — 0.5 = 50%; whereas 
the expected standard deviation would be only, 

a = x/pq/k = >/ (0.5) (0.5) /1 6 = 0.125 = 12.5% 


The theoretical standard deviation, then, decreases as the subsample 
size increases. Really, a is the standard error of the subsample expected 
mean, varying inversely as the square root of the subsample size — a 
familiar relation in measurement data. 

In the whole binomially distributed sample the standard error of the 
mean is inversely proportional to the square root of the entire number of 
individuals. That is, if there are n subsamples of k individuals each, 


g x — 



Vn 


again a familiar relation. For the entire distribution of chicks, the standard 
error in percentage is 


g x — 


25 

y/96 


2.55% 


This question was proposed above: may we attribute to sampling 
variation the discrepancy between the assumed parameter, m = 50%, 
and the corresponding sample mean, x = 2.2188/4 = 55.47%? Naturally, 
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if the sample is from a binomial distribution the test involves chi-square. 
Rather than use percentages, it is simpler to ask the question directly: 
is there a significant difference between the expected number of living 
chicks and the observed number? The former is 50% of the total number 
of chicks, 50% of 384 being 192 chicks. The latter is the value of SfX in 
the sample, 

SfX = (1)(0) + (20) (1) + (41) (2) + (25) (3) + (9) (4) = 213 
In the usual manner, 

, (213 - 192) 2 , (171 - 192) 2 , cn 

* 192 192 

whose probability of occurrence is about 3.5%. The sample number of 
surviving chicks is, therefore, significantly greater than the proposed 50%. 

It is common to find t used instead of x 2 for the test. In the case under 
discussion, the tests are identical provided a is used — s might lead to an 
erroneous conclusion. To illustrate, w^e found that the standard error 
of the mean was 

= 25/\/96 = 2.55% 

The difference between the sample mean and the parameter is 

55.47 - 50 = 5.47% 

Then, t = 5.47/2.55 - 2.15, 

near its 3.5% level, d.f. — qo, just as chi-square was near its 3.5% point. 
It is interesting to notice that, in these circumstances, t = x- That is the 
reason the /-test is valid for this kind of enumeration statistics. 


EXAMPLE 16.1 — In the manner of section 8 2 calculate x and s for the sample of 
surviving chicks. 

EXAMPLE 16 2 — For the proposed sampling of beans, p =* 1/3, k «=* 5, show 
that m = 1.667 and <r ** 1.054 colored beans per subsample, or that rn =* 33.33% 
with a — 21.08%. 

EXAMPLE 16.3 — It is convenient to bear in mind that the total number of indi- 
viduals in a binomial sample is nk, and that the total number of occurrences of the 
event is SfX. Hence, £ « 100 {SJX)/{nk) percent. 

EXAMPLE 16.4 — If 243 subsamples of the beans (k — 5, p ~ 1/3) were drawn 
at random with £ — 30.37%, show that / 2 = x 2 “ 4.80. What conclusion about the 
sample would you reach? 

EXAMPLE 16.5 — You often see some such statement as this: “As the size of the 
sample is increased, the sample mean draws nearer and nearer to the population 
value.” This would require chi-square to approach zero. Do you think it would? 


As the sample increases in size, the binomial distribution approaches 
the normal, more rapidly for values of p near 0.5 than for those near 0 or 1. 
One consequence is that t can be used instead of table 1.1 for setting confi- 
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dence limits. If n = 100, for example, and the sample contains 40 indi- 
viduals with the attribute being observed, then p — 0.4, q = 0.6, and 

40 =* tWnpq = 40 =*= 1.96\/(100)(0.4)(0.6) = 40 ± 9.6 
specifies the 95% confidence interval, 30.4 — 49.6. Compare this with 
the interval, 30 — 50, given in the table. 

As another example, suppose 100 individuals having a certain attri- 
bute are drawn in a sample of 1,000. Since in the corresponding part of 
table 1.1 ratios are being used, the interval is given by 

0.1 t M \/pq]n = 0.1 =*= 1.96 V / (0.1)(0.9)/1,000 = 0.1 =*= 0.0186 

The interval, then, is from 0.0814 to 0.1186; that is, from 8.14% to 

11 . 86 %. 

16.3 — Comparison of a sample distribution with the binomial. In 

experimental data there is everpresent sampling variation to be con- 
sidered. Under the assumption of a binomially distributed population as 
the source of the sample, we have already tested the significance of the 
deviation of the sample mean from m. The question now to be answered 
is whether the entire distribution of the sample departs from expected. 
The methods are identical with those described in chapter 9. 

For illustration we use the data in table 16.1 (6). Since these are eight- 
pig litters, there are nine classes, having respectively 0, 1, ... 8 males. 

TABLE 16.1 

Eight- Pig Litters Having Various Numbers of Mai.es Expected Numbers, and 
Test of Binomial Distribution 


Number 
Males 
in Litter 
X 


Number 

of 

Litters 

/ 


Expected 

Number 


Deviation, (Deviation)* 

Number — Expected (Deviation) 2 Expected 


0 O' 

1 5 

2 9 

3 22 

4 25 

5 26 

6 14 

7 4 

8 1 


Total 


106 


14 


19 


0 

414) 



-0 

4141 

l i nn 





3 

312 

[15 

320 

1 . 

688 J 

> — 1 3ZU 

1 

.742 

0 

114 

11 

594 J 



-2 

594 






23. 

.188 



-1 

188 


1 

411 

0 

061 

28 

984 



-3, 

.984 


15 

872 

0 

548 

23. 

,188 



2 

812 


7 

907 

0 

341 

11 

594] 



2 

4061 






3 

312 

15 

320 

0 

688 

► 3 680 

13 

542 

0 

884 

0 

41 4 j 

1 


0 

586 J 






106 

000 



0 

000 




x* — 1 . 

948 


Let us first set up the hypothesis of equal probability of the sexes. Then 
p = 0.5, q = 0.5, k = 8, n = 106, and the expected frequencies are given 
by 

106 [(0.5) 8 , 8(0. 5) 7 (0.5), 28(0.5) 6 (0.5) 2 , etc.,] 

= (106)(0.5) 8 [1, 8, 28, 56, 70, etc.,] 

= 0.414, 3.312, 11.594, etc. 
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The deviations of these expected numbers from actual measure the 
departure of the sample distribution from the binomial being considered. 
Is the departure significant? The appropriate test is made by means of 
chi-square. But since each expected number should be at least as great 
as five, the first three and last three classes are pooled for the test. The 
resulting chi-square is moderate for d.f. = 4. Actually, there is somewhat 
less deviation from expected than would be found ordinarily in random 
sampling. 

Why four degrees of freedom? The sum, 106 litters, calculated from 
the sample and used in the fitting, takes one degree. Effectively, there were 
only five classes used, leaving four degrees of freedom for the test. 

From the results of the comparison, there is no evidence against the 
hypothesis of a uniform probability of 50% for maleness. This is, of 
course, not proof that the sex ratio is 1:1. It is merely that there is not a 
significant departure from that binomial distribution in which p = q. 


EXAMPLE 16.6 — Parkes (7) collected information about sex distribution from 
the National Duroc Jersey Pig Record. Among 402 eight-pig litters he recorded the 
numbers having 0, 1, 2, ... 8 males as follows: 1, 8, 37, 81, 162, 77, 30, 5, 1. Compute 
X 2 =* 38.7, df. = 6. Can you explain the significant deviation from the hypothetical 
distribution; that is, why the probability of maleness varies significantly among the 
fitters^ 

EXAMPLE 16.7 — Ignoring the nonbinomial distribution of the sample, test the 
significance of the difference between the expected number of males in the foregoing 
example, 1,608, and the number recorded in the Record, 1,581. Chi-square is 0.90. 
Does this throw any light on the sample distribution? 

EXAMPLE 16.8 — Test the significance of the deviation of the sample of chicks 
from the binomial with p = 50%. Chi-square = 7.07, df. = 4. The mean, you may 
recall, differed significantly from 50%. 


EXAMPLE 1 6 9 — Test the significance of the deviation of the sample number of 
males (439) in table 16.1 from the number expected (424) under the 50% hypothesis. 
Chi-square = 1.06, df. — 1. 


16.4 — The test of homogeneity in a distribution of the binomial 
form. Often there is no theory upon which to base a hypothesis about 
the value of p. In that event the question raised and answered in the fore- 
going section has no meaning. Instead, we may wish to know whether 
the sample is homogeneous ; that is, may it be a sample drawn from some 
binomial distribution, p unspecified? Since there is no theoretical prob- 
ability, we simply take the sample mean percentage and set up a binomial 
distribution having that value of p. Naturally we get no test of the value 
of p , but we do learn if this probability may have been uniformly dis- 
tributed over all the individuals. 

The distribution of surviving chicks furnishes a good illustration. The 
evidence is strong that the hypothesis of 50% survival was not well founded 
(section 16.2, example 16.8). May the sample be a random drawing from 
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a binomial distribution with p ~ 55.47%, the sample mean? The suc- 
cessive terms of the binomial, 

96(0.4453 + 0.5547) 4 


are the expected frequencies. These are easily calculated: 


X 

Powers of 
<7 

Powers of 

P 

Binomial 

Coefficients 

Relative 

Frequencies 

Expected 

Frequencies 

0 

0 0393 


1 

0 0393 

3.77 

1 

0 0883 

0.5547 

4 

0.1959 

18 81 

2 

0 1983 

0 3077 

6 

0 3661 

35 15 

3 

0 4453 

0.1707 

4 

0 3040 

29 18 

4 


0 0947 

1 

0 0947 

9.09 

Total 




1 0000 

96.00 


Place q = 0.4453 in the last row but one, and in the calculating machine. 
Multiply repeatedly by q until q K is reached, recording each power in the 
table. Repeat for p, entering the successive powers downward in the 
table beginning with the second row. In the fifth column each relative 
frequency is the product of the three (or two) numbers to its left. As an 
example, in the third row, 

(0.1983)(0.3077)(6) = 0.3661 

The relative frequencies must add to unity. Multiplication of each of 
them by n = 96 yields the binomially distributed frequencies for p 
= 55.47%. 

The chi-square test is shown in table 16.2. The degrees of freedom 
are now only two — not only does the sample total, n = 96, impose a 
limit on the expected frequencies, but also the sample mean has been 
made the parameter of the binomial. Nevertheless, chi-square exceeds 
its 50% point but little. There is no evidence against the constancy of 


TABLE 16.2 

Test of Homogeneity of the Chick Survivals 


X 

/ 

Expected 

Deviation 

(Deviation) 2 

Expected 

0 

1 

2 o) 21 

3 77l«« tn 
18.81 ! 22 58 

-1.58 

0.11 

2 

41 

35 15 

5 85 

0.97 

3 

25 

29 18 

-4 18 

0 60 

4 

9 

9.09 

-0.09 

0.00 


96 

96 00 

0 00 

X a * 1.68 
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TABLE 16.3 

Number of Males in Litters of Duroc Jersey Pigs 


Five-pig Litters 

Six-pig Litters 

Number 



Number 



Males in 

Number 

Expected 

Males in 

Number 

Expected 

Litter 

Litters 

Numbers 

Litter 

Litters 

Numbers 

0 

1 

2o} 22 

4.08\ oa r a 
19.46/ 23 54 

0 

1 

3^ 19 

I6f lj 

3 -621 24 oq 

21.37 r 4,yv 

2 

41 

37.07 

2 

53 

52.55 

3 

35 

35 31 

3 

78 

69 08 

4 

“l 18 

^l?} 20 05 

4 

53 

50 98 

5 

5 

6 

*o} 18 

V2I} 23 40 

Total 

116 

X* = 0.73 


221 

3.92 


p~ 48.79% 



p= 49.62% 



the probability of survival. The population may be homogeneous, with a 
uniform probability of 55.47%. 

After two samples have been found homogeneous, it is often desired 
to test the significance of the difference between their means. Assuming 
a uniform probability in each, do these probabilities differ significantly? 
In table 16.3 are the distributions of males in five- and six-pig litters. 
Each is homogeneous with respect to its own probability. Do these means 
differ significantly? The test is easily carried out in a 2 X 2 table like this: 


Litters 

Males 

Females 


Total 

Five-pig 

Six-pig 

283 

297 


580 

658 

668 


1,326 

Total 

941 

965 


1,906 


, [(263) (668) - (297)(658)]»(1906) 

X (580) (1,326) (941) (965) 

For d.f. = 1, this is a smaller value of chi-square than^usual. The small 
difference between the two percentages, 

49.62 - 48.79 = 0.83%, 

is not significant. 

If a pooled value of p , 941/1,906 = 49.3704, is used to compute the 
standard deviation, y/pq(\/n\ + 1 /n%) = 0.0249 this test may be carried 
through with t in the manner of the last section. That is, x witH one 
degree of freedom is the same as t with d.f. = oo . 

If there are three or more samples to be compared, the method of 
section 9.6 is probably the easiest. 
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What shall be done with samples that are not homogeneous? If there 
is no uniform probability of occurrence, and if nothing is known of the 
laws governing the probability, any conclusions are based on flimsy evi- 
dence. There is no assurance that, if the experiment were repeated, the 
same set of probabilities would be existent. If one doesn’t know why the 
probabilities change or in what manner, he can set up no experimental 
controls. The first step must be toward improved knowledge of the tech- 
niques of selecting and handling the experimental material. Only after 
the sources of variation are discovered may valid comparisons be made. 

Nevertheless, there are times when one wishes to make a test of signifi- 
cance of the difference between means despite heterogeneity. If means 
differ significantly even in the face of large variation, perhaps the differ- 
ence represents some population fact. The method used to make the test 
must be sensitive to the variability present. Probably the best is to calcu- 
late the standard deviation of each set in the manner of section 8.2, then 
test significance as in section 8.8. The assumption of normal distribution 
there made is more flexible than the assumption of constant p. 

You may find interest in contrasting the methods of treating the bi- 
nomial distribution and the normal. Each has two parameters. A test 
of significance of sample deviation from theoretical distribution is 
available in each. But there is a single, basic normal distribution, the 
relative frequencies being set out in table 8.6, whereas there is a separate 
binomial for each p and k. It follows that tables of the binomial distribu- 
tions would be clumsy — it is easier to compute each time from the para- 
meters. 

EXAMPLE 16.10 — Calculate chi-squarc. for each of the samples of table 16.3. 

EXAMPLE 16.11 — Decker and Andre (4) exposed adult chinch bugs at a tempa- 
ture of —8.5° C. for 15 minutes, then counted the number dead in 100 subsamplcs 
of 10 bugs. The numbers of subsamples having 0, 1, 2, ... 7 dead were 4, 21, 22, 28, 
14, 8, 2, 1 No subsample had 8 or more dead. Is there a significant lack of homogeneity 
for probability of death? Chi-square = 2 89 > dj. — 8. 

EXAMPLE 16.12 — The sample probability of maleness in example 16.6 is 49.160%. 
Is the sample homogeneous with respect to this percentage? Chi-square = 38.15, 
dj. « 5. 

EXAMPLE 16 13 — In example 16.9 the sample proportion of males in table 16.1 
was found to be not significantly different from 0.5. Is the sample homogeneous with 
respect to the probability 0 5177? Chi-square = 0.73, dj. — 3. 

EXAMPLE 16.14 — Using the methods of chapter 8, test the significance of the 
difference between the mean percentages of males in the samples of table 16 1 and 
example 16.6. Since the latter departs significantly from binomial distribution, the 
use of chi-square in a 2 X 2 table is inappropriate. Answer, t = 0.35. 

EXAMPLE 16.15 — Test the significance of the difference between the percentages 
of males in the five- and six-pig litters of tabic 16.3, using t instead of x 2 * Your result 
should bet =* x — \Ahlll. 

16.5 — The Poisson distribution. Like the binomial, this is the distri- 
bution of a discrete or discontinuous variable arising from enumeration, 
usually only the integral values occurring. Also, there is the idea of the 
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uniform probability of the occurrence of some event, but this probability, 
ordinarily small, is somewhat ill defined. In fact, the size of the sub- 
sample, though supposed to be constant, is usually not known. It is 
large, however, and must be confined within reasonably narrow limits. 

In contrast with both the binomial and the normal, the Poisson dis- 
tribution is defined by a single parameter, the mean. The variance is 
equal to the mean. 

The Poisson distribution can be shown to be a limiting form of the 
binomial in which p is very small (9). The distribution is valuable in its 
own right, however, quite independent of this relation to the binomial. 

If m is the expected mean frequency of occurrence of some event in the 
population of subsamples, and if the frequency follows the Poisson distri- 
bution, then the event will occur 0, 1, 2, . . . times in subsamples with 
relative frequencies equal to the successive terms of this sequence, 

1 m m 2 m z m A 

P P (2)(3)P (2)(3)(4)e m ’ 

where e , the base of the natural logarithm, is an irrational number whose 
value is about 2.718, and whose logarithm (base 10) is 0.434295 very 
nearly. The expected frequencies are products of these relative frequencies 
by the number of subsamples, n. 

The fitting of a Poisson distribution to a sample will be illustrated by 
the data on noxious weed seeds (5) in table 16.4. The subsamples are 
specified as weighing ounce. Naturally, the number of seeds in the 
subsample varies somewhat, but is large compared with the number of 
noxious weed seeds. The Poisson distribution, like the binomial, is built 

TABLE 16.4 

Distribution of 98 Quarter-ounce Subsamples of Phleum Pratense Classified by 
Number of Noxious Weed Seeds. Fitted Poisson Distribution. 

Test of Significance 


Numbers of 




Deviations 

(Deviations) 2 

Noxious 

Number of 

Expected 

Numbers 

From 

Weed Seeds 

Subsamples 

Expected 

Expected 


/ 




0 

3 

4 78 


-1 78 

0 663 

1 

17 

14 44 


2 56 

0 454 

2 

26 

21 81 


4 19 

0 805 

3 

16 

21 96 


-5 96 

1 618 

4 

18 

16 58 


1 42 

0 122 

5 

9 

10.02 


-1 02 

0 104 

6 

3) 

5.041 




7 

5 0 

2 18 

L 41 



8 

9 

?I 

0.82 
0 28 


0 59 

0.041 

10 


0 08 




More 


0 01 





98 98.00 0 00 x s = 3.807 


Total 



442 


Statistical Methods 


Section 16.5 


on the assumption that the probability that any seed is that of a noxious 
weed is constant throughout the population. We shall first fit the theo- 
retical distribution, then test the significance of the deviation of the sample. 

The first step is the calculation of the sample mean in the manner of 
section 8.2: 

ot = {SfX)/(Sf) = 296/98 = 3.02041 noxious weed seeds per subsample 

We then set up a Poisson distribution with this mean as its parameter, 
m. A distribution with some theoretical mean might be fitted, but that 
would be unusual. Ordinarily the sample mean is used. We are testing 
homogeneity only — is there a uniform probability of occurrence of weed 
seeds throughout the sample? 

The remainder of the calculation below is by means of logarithms. 
The successive steps are rather evident. After the logarithms of the first 
two expected numbers are reached, those of the successive numbers are 
got by (i) adding the logarithm of m, and (ii) subtracting the logarithm 
of the successive integers beginning with 2. You can see this by writing 
the expected numbers in this way: 

n nm ( nm\ ( m\ l nm 2 \ / m \ 

7*' \^)\2f \ 2e™ ) \ 3 / 


The calculations proceed thus: 


Symbol 

Logarithm 

Expected Number 

n =* 98 

t m (3 02041) (0.434295) 

n/e m 

1 99123 
- 1 31175 

0 67948 

4 78 

m « 3 02041 

0 48007 


mn/e m 

1 15955 

14 44 

m 

0 48007 


2 

1 63962 

0 30103 


mPn/le'* 

1 33859 

21 81 

m 

0 48007 


3 

T 81866 

0 47712 


m*n/ (2) (3)*" 

1 34154 

21 96 

m 

0 48007 


4 

1 82161 

0 60206 


m 4 n/(2)(3)(4V" 

1 21955 

16 58 

m 

0 48007 


5 

1 69962 

0 69897 

1 00065 

10 02 
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m 

6 

0.48007 

1.48072 

0.77815 

0.70257 


5.04 

m 

0 48007 




1 18264 



7 

0 84510 




0 33754 


2 18 

m 

0 . 48007 




10 81761 - 

10 


8 

0 90309 




9 91452 - 

10 

0 82 

m 

0.48007 




10 39439 - 

10 


9 

0 95424 




9 44035 - 

10 

0 28 

m 

0 48007 




9 92042 - 

10 


10 

1 




8.92042 - 

10 

0.08 

Total 



97.99 

From 

the logarithm of 98 is subtracted the logarithm of e m which is 

m\og e 

= (3.02041) (0.434295). The difference is 

the logarithm of the 


quotient, the anti-logarithm being the expected number of subsamples 
containing no weed seeds. Next is added the logarithm of 3.02041 to get 
the logarithm of the next expected number, 14.44. Thereafter the loga- 
rithm of m is added each time while the logarithm of the next higher 
integer is subtracted. The process continues till the expected numbers 
become smaller than 5. We carried on to illustrate the process, but had 
to recombine for the test in table 16.4. 

Chi-square is computed as usual. How many degrees of freedom are 
there? The sample mean has been set up as the parameter of the distribu- 
tion, and the sample number of subsamples, 98, is used to calculate the 
expected numbers remaining after the fifth (or tenth) class. Hence two 
degrees of freedom must be deducted from the effective number of 
classes, leaving dj. = 5 for testing chi-square. Clearly, the sample does 
not depart significantly from the Poisson distribution. The 98 bags of 
seeds from which the subsamples were taken may constitute a population 
with Poisson distribution. The probability that any randomly selected 
seed be that of a noxious weed may be uniform throughout. 

The variance in the theoretical distribution is the same as the mean, 
3.0204. Hence s — \/3.0204 = 1.738 seeds per subsample. Obviously, 
the standard deviation has different relations to the Poisson distribution 
than to the normal. For example, it cannot be said that 68.27% of the 
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individuals in the unsymmetrical Poisson distribution lie within the inter- 
val, m a. However, if the entire sample of 98 quarter ounces be con- 
sidered as a unit with 296 weed seeds, it may be thought of as a single 
sample drawn from a population whose mean is estimated as 296 and whose 
variance is therefore 296. Poisson populations with such large means are 
distributed almost normally. Hence, for practical purposes this large 
sample may be treated as though drawn from a normal population whose 
mean is estimated as x = 296 with standard error, s& = \/296 = 17.2 
weed seeds. The numbers of weed seeds in two such samples may then be 
compared, the difference being tested for significance in the manner of 
the next section. 


EXAMPLE 16.16 — Leggatt counted various other weed seeds in his 98 subsamples. 
Numbers of two genera follow: 



Number of Seeds in Subsamplc 



Genus 

0 

1 

2 

3 

4 5 6 

7 

Chi-square 

P 

Potentilla ! 

37 

32 

16 

9 

2 0 1 

1 

2.931 

24% 

Carex 

36 

36 

23 

2 

1 


4.476 

H% 


EXAMPLE 16.17 — The chinch bug data of example 16.11 are well adapted to 
binomial fitting. It may be interesting to apply the Poisson distribution. Chi-square 
- 4.634, df. ** 5. ' 

EXAMPLE 16.18 — If you have the book available, look at table 51 in Pearson’s 
(8) collection and compare the distribution for m = 3.0 with that of table 16.4. Make 
a rough check on your work in example 16.17 by use of the same table. 


EXAMPLE 16.19 — If m is large, the Poisson distribution is approximately normal. 
To verify this, work out the distribution for m = 1 5, or take it from Pearson’s table. 
Group the expected frequencies for n = 100 as follows: 


Class mark 

3 

6 

9 

12 

15 

18 

21 

24 

27 

Frequency 

1 

1 

10 

24 

30 

22 

9 

2 

1 


Using the method of section 8 6, show that g\ = 0.094 =*= 0.24 and g 2 553 0.46 =*= 0.48. 


EXAMPLE 16.20 — The sample of table 16.4 may be looked upon as a single 
subsample of 24.5 ounces drawn from a Poisson population with m =» 296 noxious 
weed seeds. The variance of this mean is 296. Comparing this with the foregoing 
example, you see why large means from Poisson populations may be considered as 
normally distributed. 


16.6 — Comparison of means in samples from Poisson distributions. 

A moment’s thought will make it clear that means from Poisson distribu- 
tions cannot be compared like those from the binomial. The records give 
the number of times the event occurs but not the times it fails. For small 
samples a chi-square method of testing was explained in section 9.12. But 
since large sample means drawn from a Poisson population are distributed 
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almost normally, it turns out that the appropriate method of testing differ- 
ences is that of section 8.8. An example will recall the method. 

“Student” (9) gave two distributions of yeast cells counted in the 400 
squares of a haemacytometer, table 16.5. The values of chi-square show 


TABLE 16.5 

Two Samples of Yf.ast Cells Counted in 400 Squares of a Haemacytometer 
Fitted Poisson Distributions 



Number Squares Having Specified Number of Cells 


Chi- 

square 

Sample 

0 

1 

2 

3 

4 

5 6 7 

8 9 

Mean 

1 

Observed 103 

143 

98 

42 

8 

4 2 


529 

3.177 


Expected 106 

141 

93 

41 

14 

5 




2 

Observed 75 

103 

121 

54 

30 

13 2 1 

0 1 

720 

7.073 


Expected 66 

119 

107 

64 

29 

15 





that they may well be samples from Poisson distributions. The difference 
between the means is 720 — 529 — 191 yeast cells per sample. To test 
significance, we have the two variances, the same as the means, 720 and 
529. Prom these, the standard error of the mean difference is 

v/720 + 529 = 35.3 

Hence, / = 191/35.3 = 5.41, strong evidence that the two samples are not 
drawn from the same population. 

EXAMPLE 16 21 — Calculate the expected numbers in table 16.5, together with 
the values of chi-square. 

EXAMPLE 16 22 — “Student” recorded one sample of yeast cells with this distri- 
bution 


Number of yeast cells 

0 

1 

2 

3 4 

5 

Number of squares 

213 

128 

37 

18 3 

1 


Compute x 2 = 9.683, d f. — 2. This departs significantly from Poisson distribution. 

EXAMPLE 16.23 — Using the methods of chapter 8, test the significance of the 
difference between the means of the two samples in table 16.5. You would be assuming 
that mean differences are normally distributed, but you would avoid the formula, 
£ = j 2 . Although this formula is justified in the samples of table 16 5, it would not be 
appropriate if the samples departed significantly from Poisson distribution. Ans., 
t ~ 5 30. 

EXAMPLE 16.24 — In table 16.5, carry through the test of significance by com- 
paring the mean numbers of yeast cells per square , 1.3225 and 1.800. You should reach 
the same value of t. 

16.7 — Transformations of data for test of significance. In a binomial 
distribution the variance, kpq or kp( 1 — p ), is clearly related to the mean, 
kp\ while in the Poisson the two are equal. In measurement data, also, it 
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is by no means unusual to come across groups in which similar relations 
appear, or in which the mean and standard deviation are proportional. 
If tests of significance by use of t or F are to be applied, it may be desirable 
to transform such variates into new sets having the mean and variance 
independent (section 11.17). To meet the situations which ordinarily 
occur suitable transformations have been devised (1) (2). 

Consider first an investigation in which the occurrence of some event 
is being counted, as for example, the number of wireworms found in an 
experimental plot or under a square foot of sod. If the counts run 50 or 
more per plot, they may be analyzed usually without change. Examine 
the ranges in the several treatments, however, to detect any tendency for 
the variance to be proportional to the mean. Counts of less than 50 per 
plot are likely to be noticeably distributed in the Poisson fashion. The 
appropriate transformation is the square root of the count. Make a new 
table in which the square root of the number of wireworms is entered 
instead of the actual count, then proceed as usual. 

In experiments where 10 or less are counted in the plots, the square 
root tends to over-correct, but \/X + 0.5 may be used, X being the ob- 
served number. This method was followed in testing the data of table 
16.6 (3). 


TABLE 16.6 

Number of Wireworms in the Plots of a Latin Square, the Treatments Being 
a Check (O) and Four Different Soil Fumigations Applied in the Previous Year 





Columns 



Rows 

1 

2 

3 

4 

5 

Treatment 

P 

0 

N 

K 

A/ 

Wireworms, X 

3 

2 

5 

1 

4 

VX~+ 0.5 

1 87 

1 58 

2 34 

1 22 

2 12 

Treatment 

M 

K 

O 

N 

P 

Wireworms, X 

6 

0 

6 

4 

4 

y/X + 0.5 

2 55 

0 71 

2 55 

2 12 

2 12 

Treatment W 

0 

M 

K 

P 

N 

Wireworms, X 

4 

9 

1 

6 

5 

y/'X + 0.5 

2.12 

3 08 

1 22 

2 55 

2 34 

Treatment 

M 

P 

M 

0 

K 

Wireworms, X 

17 

8 

8 

9 

0 

y/X + 0.5 

4 18 

2 92 

2.92 

3 08 

0 71 

Treatment 

K 

N 

P 

M 

0 

Wireworms, X 

4 

4 

2 

4 

8 

y/X~+~05 

2.12 

2.12 

1 58 

2.12 

2.92 

Treatments 

0 

N 

K 

P 

M 

Means of sq uare roots 2.45 

2 62 

1 20 

2.21 

2.56 



Section 16.7 


Binomial and Poisson 


447 


Analysis of Variance 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Rows 

4 

2 4812 


Columns 

4 

0.9026 


Treatments 

4 

6.8747 

1.719 

Error 

12 

4.4551 

0.371 


F - 1.719/0.371 - 4.63* 


It is clear that K was the treatment responsible for significance. In 
applying the methods of chapter 15, use the sums and sums of squares of 
the square roots, not of the original numbers. But in presenting the 
results, the means should be changed back to numbers of wireworms. 
Thus, the appropriate mean number for 0 is 

(2.45) 2 — 0.5 = 5.50 wireworms per plot 

Do not attempt to apply any single standard error to the resulting mean 
numbers because each has its own. Carry out tests of significance with 
the transformed means. 

To acquire judgment about transformations, analyze the original data 
and compare the two results. 

The second transformation to be presented is one appropriate to the 
kind of data associated with the binomial — so many occurrences among a 
definite total counted, or percentage of occurrences. For illustration wc 
use the data in table 16.7, courteously provided by Dr. J. C. Gaines of 
the Texas Agricultural Experiment Station. The counts of punctured 
squares were first converted to percentages. Since the numbers of punc- 
tured squares were all less than 100, a transformation is advisable. The 
one to be used converts percentages into angles, 

angle — arc sin V percentage 

For convenience wc give this transformation in table 16.8. As an example 
of its operation, let us get out the angle for 17.4%. Look for 17 in the left 
hand column and for 4 in the top row, reading the angle 24.65°. Again, 
take the percentage 0.56. In row 0.5 and column 6 read 4.29°. 

If most or all of your percentages are above 50%, you may prefer to 
work with the other of the two conjoint events ( free instead of infested , for 
example). 

The analysis of variance of the angles shows that the treatments dif- 
fered in their control of the boll weavil. However, treatments A, B , and C 
(different concentrations of calcium arsenate) had about equal effects, 
while D (calcium arsenate, Paris green and sulphur) was almost as ineffec- 
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tive as no treatment, E. The average percentages of squares punctured 
are 14.0, 12.3, 15.4, 24.7, and 30.6, easily read from table 16.8 corre- 
sponding to the mean angles shown in table 16.7. 

It should be observed that conclusions would have been the same had 
the original data been used without transformation. 

The third (and last) case to be discussed is that in which the group 
means and standard deviations tend to be proportional — the coefficients 
of variation are much the same. The proper transformation is to loga- 
rithms whether the data are counts, percentages or measurements. The 
method is illustrated by the catch of plankton nets at the left side of 

TABLE 16.7 

Percentage of Squares Punctured by Bom Weavils on Untreated Cotton 
Plants ( E ) and on Plants Dusted With Four Materials. Latin Square Layout. 

Two Hundred Squares per Plot Examinfd. Also, Corresponding 
Angies in Degrefs 





Columns 



Rows 

1 

2 

3 

4 

5 

Treatment 1 

C 

E 

A 

D 

B 

Percentage 

27 

42 

18 

34 

17 

Angle 

31 3 

40 4 

25.1 

35 7 

24 4 

Treatment 2 

A 

D 

E 

B 

C 

Percentage 

18 

27 

42 

14 

12 

Angle 

! 25 1 

31.3 

40 4 

22.0 

20 3 

Treatment 3 

D 

B 

C 

E 

A 

Percentage 

23 

14 

17 

25 

14 

Angle 

28 7 

22 0 

24 4 

30 0 

22 0 

Treatment 4 

E 

A 

B 

C 

D 

Percentage 

24 

10 

8 

12 

26 

Angle 

29 3 

18 4 

16 4 

20 3 

30 7 

Treatment 5 

B 

C 

D 

A 

E 

Percentage 

9 

11 

15 

11 

22 

Angle 

17 5 

19 4 

22 8 

19 4 

28 0 

Treatment 

A 

B 

C 

D 

E 

Mean angle 

22 0 

20 5 

23.1 

29.8 

33 6 


Analysis of Variance of Angles 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Rows 

4 

311 3 

77.82 

Columns 

4 

6.0 

1.50 

Treatments 

4 

637.5 

159.38** 

Error 

12 

108 8 

9 07 
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TABLE 16.8 

Angles Corresponding to Percentages, Angle «* Arc sin \/ Percentage, 
as Given by C. I. Buss (2) 


% 

C 

> 

1 

2 

3 

A 

I- 


> 

6 

1 

t 

8 

$ 

> 

0.0 

0 


0. 

57 

0 

81 

0 

99 

1 

15 

1 

28 

1 

40 

1 . 

52 

1 . 

62 

1 . 

72 

0 1 

1 

81 

1 

90 

1 

99 

2 

07 

2 

14 

2 

22 

2, 

29 

2 

36 

2 

43 

2. 

50 

0.2 

2 

56 

2. 

63 

2 

69 

2 

75 

2 

81 

2, 

87 

2 

92 

2 

98 

3 

03 

3. 

09 

0 3 

3 

14 

3 

19 

3 

24 

3 

29 

3 

34 

3 

39 

3 

44 

3 

49 

3 

53 

3. 

58 

0 4 

3 

63 

3 

67 

3 

72 

3. 

76 

3. 

80 

3 

85 

3 

89 

3. 

93 

3 

97 

4 

01 

0 5 

4 

05 

4 

09 

4 

13 

4 

17 

4 

21 

4 

25 

4 

29 

4 

33 

4 

37 

4 

40 

0 6 

4 

44 

4 

48 

4 

52 

4 

55 

4 

59 

4 

62 

4 

66 

4 

69 

4 

73 

4 

76 

0 7 

4 

80 

4 

83 

4 

87 

4 

90 

4 

93 

4 

97 

5 

00 

5 

03 

5 

07 

5 

10 

0 8 

5 

13 

5, 

16 

5 

20 

5 

23 

5, 

26 

5. 

29 

5 

32 

5 

35 

5 

38 

5 

41 

0 9 

5. 

44 

5. 

47 

5 

50 

5 

53 

5 

56 

5 

59 

5 

62 

5 

65 

5. 

68 

5 

71 

1 

5 

74 

6 

02 

6 

29 

6 

55 

6. 

80 

7 

04 

7 

27 

7 

49 

7. 

71 

7 

92 

2 

8 

13 

8 

33 

8 

53 

8 

72 

8. 

91 

9 

10 

9 

28 

9 

46 

9 

6.3 

9 

81 

3 

9 

98 

10 

14 

10 

.31 

10. 

47 

10 

63 

10 

78 

10 

94 

11 

09 

11 

24 

11 

39 

4 

11 

54 

11 

68 

11 

83 

11. 

97 

12 

11 

12. 

25 

12. 

39 

12. 

52 

12 

66 

12 

79 

5 

12 

92 

13 

05 

13 

18 

13 

31 

13 

44 

13 

56 

13 

69 

13 

81 

13 

94 

14 

06 

6 

14 

18 

14 

30 

14 

42 

14 

54 

14 

65 

14 

77 

14 

89 

15 

00 

15 

12 

15 

23 

7 

15 

34 

15 

45 

15 

56 

15, 

,68 

15 

79 

15 

89 

16 

00 

16 

11 

16 

22 

16 

32 

8 

16 

43 

16, 

54 

16 

64 

16 

74 

16 

85 

16 

95 

17 

05 

17 

16 

17 

26 

17 

36 

9 

17, 

,46 

17 

56 

17 

66 

17 

76 

17. 

85 

17 

95 

18 

05 

18 

15 

18 

24 

18 

34 

10 

18 

44 

18 

53 

18 

63 

18 

72 

18 

81 

18 

91 

19 

00 

19 

09 

19 

19 

19 

28 

11 

19 

37 

19 

46 

19 

55 

19 

64 

19 

73 

19 

82 

19 

91 

20 

00 

20 

09 

20 

18 

12 

20 

27 

20 

36 

20 

44 

20 

53 

20 

62 

20 

70 

20 

79 

20 

88 

20 

96 

21 

05 

13 

21 

13 

21 

22 

21 

30 

21 

39 

21 

47 

21 

56 

21 

64 

21 

72 

21 

81 

21 

89 

14 

21 

97 

22 

06 

22 

14 

22 

22 

22. 

30 

22 

38 

22 

46 

22. 

.55 

22 

63 

22 

71 

15 

22 

79 

22 

87 

22 

95 

23 

03 

23 

11 

23 

19 

23 

26 

23 

34 

23 

42 

23 

50 

16 

2 $ 

58 

23 

66 

23 

73 

23 

81 

23 

89 

23 

97 

24 

04 

24 

12 

24 

20 

24 

27 

17 

24 

35 

24 

43 

24 

50 

24 

58 

24 

65 

24 

73 

24 

80 

24 

88 

24 

95 

25 

03 

18 

25 

10 

25 

18 

25 

25 

25 

33 

25 

40 

25 

48 

25 

55 

25 

62 

25 

70 

25 

77 

19 

25 

84 

25 

92 

25 

99 

26 

06 

26 

13 

26 

21 

26 

28 

26 

35 

26 

42 

26 

49 

20 

26 

56 

26 

64 

26 

71 

26 

78 

26 

85 

26 

92 

26 

99 

27 

06 

27 

1.3 

27. 

20 

21 

27 

28 

27 

35 

27 

42 

27 

49 

27 

56 

27 

63 

27 

69 

27 

76 

27 

83 

27. 

90 

22 

27 

97 

28 

04 

28 

11 

28 

18 

28 

25 

28 

32 

28 

38 

28 

45 

28 

52 

28 

59 

23 

28 

66 

28 

73 

28 

79 

28 

86 

28 

93 

29 

00 

29 

06 

29 

13 

29 

20 

29 

27 

24 

29 

33 

29 

40 

29 

47 

29 

53 

29 

60 

29 

67 

29 

73 

29 

80 

29 

87 

29. 

93 

25 

30 

00 

30 

07 

30 

13 

30 

20 

30 

26 

30 

33 

30 

40 

30 

46 

30 

53 

30 

59 

26 

30 

66 

30 

72 

30 

79 

30 

85 

30 

92 

30 

98 

31 

05 

31 

11 

31 

18 

31 

24 

27 

31 

31 

31 

37 

31 

44 

31 

50 

31 

56 

31 

63 

31 

69 

31 

76 

31 

82 

31 

88 

28 

31 

95 

3? 

01 

32 

08 

32 

14 

32 

20 

32 

27 

32 

33 

32 

39 

32 

4i 

32 

52 

29 

32 

58 

32 

65 

32 

71 

32 

77 

32 

83 

32 

90 

32 

96 

33 

02 

33 

09 

33, 

15 

30 

33 

21 

33 

27 

33 

34 

33 

40 

33 

46 

33 

52 

33 

58 

33 

65 

33 

.71 

33 

77 

31 

33 

83 

33 

89 

33 

.96 

34 

02 

34 

08 

34 

14 

34 

20 

34 

27 

34 

33 

34 

39 

32 

34 

45 

34 

51 

34 

57 

34 

63 

34 

70 

34 

76 

34 

82 

34 

88 

34 

94 

35 

00 

33 

35 

06 

35 

12 

35 

18 

35 

24 

35 

30 

35 

37 

35 

43 

35 

49 

35 

55 

35 

61 

34 

35 

67 

35 

73 

35 

79 

35 

85 

35 

91 

35 

97 

36 

03 

36 

09 

36 

15 

36 

21 

35 

36 

27 

36 

33 

36 

39 

36 

45 

36 

51 

36 

57 

36 

63 

36 

69 

36 

75 

36 

81 

36 

36 

87 

36 

93 

36 

99 

37 

05 

37 

11 

37 

17 

37 

23 

37 

2.9 

37 

35 

37 

41 

37 

37 

47 

37 

52 

37 

58 

37 

64 

37 

70 

37 

76 

37 

82 

37 

88 

37 

94 

38 

00 

38 

38 

06 

38 

12 

38 

17 

38 

23 

38 

29 

38 

35 

38 

41 

38 

.47 

38 

53 

38 

59 

39 

38 

65 

38 

70 

38 

76 

38 

82 

38 

88 

38 

94 

39 

00 

39 

06 

39 

11 

39 

17 

40 

39 

23 

39 

29 

39 

35 

39 

41 

39 

47 

39 

52 

39 

58 

39 

64 

39 

70 

39 

76 

41 

39 

82 

39 

87 

39 

93 

39 

99 

40 

05 

40 

11 

40 

16 

40 

22 

40 

28 

40 

34 

42 

40 

40 

40 

46 

40 

51 

40 

57 

40 

63 

40 

69 

40 

74 

40 

80 

40 

86 

40 

92 

43 

40 

98 

41 

03 

41 

09 

41 

15 

41 

21 

41 

27 

41 

32 

41 

38 

41 

44 

41 

50 

44 

41 

55 

41 

.61 

41 

67 

41 

73 

41 

.78 

41 

.84 

41 

.90 

41 

96 

42 

02 

42 

07 

45 

42 

13 

42 

19 

42 

25 

42 

30 

42 

36 

42 

42 

42 

48 

42 

.53 

42 

59 

42 

65 

46 

42 

71 

42 

76 

42 

82 

42 

88 

42 

94 

42 

99 

43 

05 

43 

11 

43 

17 

43 

22 

47 

43 

28 

43 

34 

43 

39 

43 

45 

43 

51 

43 

57 

43 

62 

43 

68 

43 

.74 

43 

80 

48 

43 

85 

43 

91 

43 

97 

44 

03 

44 

08 

44 

14 

44 

20 

44 

25 

44 

31 

44 

37 

49 

44 

43 

44 

48 

44 

54 

44 

60 

44 

66 

44 

71 

44 

.77 

44 

83 

44 

89 

44 

94 


(Continued next page) 
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TABLE 16.8— (Continued) 

Angles Corresponding to Percentages, Angle * Arc sin\/ Percentage, 
as Given by C. I. Bliss (2) 


% 0 1 2 3 4 5 6 7 8 9 


50 


45, 

,00 

45.06 

45 

11 

45. 

17 

45, 

,23 

45 

29 

45 

34 

45 

40 

45 

46 

45 

52 

51 


45. 

,57 

45 63 

45 

69 

45 

75 

45 

80 

45, 

,86 

45 

92 

45 

97 

46 

03 

46 

09 

52 


46 

15 

46 20 

46 

26 

46 

32 

46 

38 

46 

43 

46 

49 

46 

55 

46 

61 

46 

66 

53 


46 

72 

46 78 

46 

83 

46 

89 

46 

95 

47 

01 

47 

06 

47 

12 

47 

18 

47, 

.24 

54 


47. 

.29 

47 35 

47 

41 

47 

47 

47 

52 

47 

58 

47 

64 

47 

70 

47, 

.75 

47. 

.81 

55 


47 

87 

47.93 

47 

98 

48 

04 

48 

10 

48 

16 

48 

22 

48 

27 

48 

33 

48. 

,39 

56 


48 

45 

48 50 

48 

56 

48 

62 

48 

68 

48 

73 

48 

79 

48 

85 

48 

91 

48 

97 

57 


49 

02 

49 08 

49 

14 

49 

20 

49 

26 

49 

31 

49 

37 

49 

43 

49 

49 

49 

54 

58 


49 

60 

49 66 

49 

72 

49 

78 

49 

84 

49 

89 

49 

95 

50 

01 

50 

07 

50, 

13 

59 


50 

18 

50 24 

50 

30 

50 

36 

50 

42 

50 

48 

50 

53 

50 

59 

50 

65 

50 

71 

60 


50 

77 

50 83 

50 

89 

50 

94 

51 

00 

51 

06 

51 

12 

51 

18 

51 

24 

51 

30 

61 


51 

35 

51 41 

51 

47 

51 

53 

51 

59 

51 

65 

51 

71 

51 

77 

51 

83 

51 

88 

62 


51 

94 

52 00 

52 

06 

52 

12 

52 

18 

52 

24 

52 

30 

52 

36 

52 

42 

52 

48 

63 


52 

53 

52 59 

52 

65 

5? 

71 

52 

77 

52 

83 

52 

89 

52 

95 

53 

01 

53 

07 

64 


53 

13 

53 19 

53 

25 

53 

31 

53 

37 

53 

43 

53 

49 

53 

55 

53 

61 

53 

67 

65 


53 

73 

53 79 

53 

85 

53 

91 

53 

97 

54 

03 

54 

09 

54 

15 

54 

21 

54 

27 

66 


54 

.33 

54 39 

54 

45 

54 

51 

54 

57 

54 

63 

54 

70 

54 

76 

54 

82 

54 

88 

67 


54 

94 

55 00 

55 

06 

55 

12 

55 

18 

55 

24 

55 

30 

55 

37 

55 

43 

55 

49 

68 


55. 

.55 

55 61 

55 

67 

55 

73 

55 

80 

55 

86 

55 

92 

55 

98 

56 

04 

56 

11 

69 


56 

17 

56 23 

56 

29 

56 

35 

56 

42 

56 

48 

56 

54 

56 

60 

56 

66 

56, 

,73 

70 


56 

79 

56 85 

56 

91 

56 

98 

57 

04 

57 

10 

57 

17 

57 

23 

57 

29 

57 

35 

71 


57 

42 

57.48 

57 

54 

57 

61 

57 

67 

57 

73 

57 

80 

57 

86 

57 

92 

57 

99 

72 


58 

05 

58 12 

58 

18 

58 

24 

58 

31 

58 

37 

58 

44 

58 

50 

58 

56 

58 

63 

73 


58 

69 

58 76 

58 

82 

58 

89 

58 

95 

59 

02 

59 

08 

59 

15 

59 

21 

59 

28 

74 


59 

34 

59 41 

59 

47 

59 

54 

59 

60 

59 

67 

59 

74 

59 

80 

59 

87 

59 

93 

75 


60 

00 

60 07 

60 

13 

60 

20 

60 

27 

60 

33 

60 

40 

60 

47 

60 

53 

60 

60 

76 


60 

67 

60 73 

60 

80 

60 

87 

60 

94 

61 

00 

61 

07 

61 

14 

61 

21 

61 

27 

77 


61 

34 

61.41 

61 

48 

61 

55 

61 

62 

61 

68 

61 

75 

61 

82 

61 

89 

61 

96 

78 


62 

03 

62 10 

62 

17 

62 

24 

62 

31 

62 

37 

63 

44 

62 

51 

62 

58 

62 

65 

79 


62 

72 

62 80 

62 

87 

62 

94 

63 

01 

63 

08 

63 

15 

63 

22 

63 

29 

63 

36 

80 


63. 

44 

63 51 

63 

58 

63, 

65 

63 

72 

63 

79 

63 

87 

63 

94 

64 

01 

64 

08 

81 


64. 
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TABLE 16.9 

Estimated Numbers of Four Kinds of Plankton Caught in Six Hauls 
With Each of Two Nets 


Estimated Numbers 


Logarithms 


Count 

I 


II 

III 

IV 

I 

I 

III 

IV 

1 i 

895 

1,520 

43,300 

11,000 

2.95 

3.18 

4 64 

4.04 

2 ! 

540 

1,610 

32,800 

8,600 

2.73 

3 21 

4.52 

3.93 

3 1 

1,020 

1 

,900 

28,800 

8,260 

3 01 

3 28 

4 46 

3 92 

4 1 

470 

1 

,350 

34,600 

9,830 

2 67 

3.13 

4.54 

3.99 

5 

428 


980 

27,800 

7,600 

2 63 

2 99 

4.44 

3 88 

6 

620 

1,710 

32,800 

9,650 

2.79 

3 23 

4 52 

3 98 

7 j 

760 

1,930 

28,100 

8,900 

2 88 

3 29 

4 45 

3 95 

8 

537 

1 

,960 

18,900 

6,060 

2 73 

3 29 

4.28 

3.78 

9 

845 

1,840 

31,400 

10,200 

2 93 

3.26 

4.50 

4 01 

10 

1,050 

2,410 

39,500 

15,500 

3 02 

3 38 

4.60 

4.19 

11 

387 

1,520 

29,000 

9,250 

2 59 

3 18 

4.46 

3.97 

12 

497 

1,685 

22,300 

7,900 

2 70 

3 23 

4.35 

3 90 

Mean 

671 

1,701 

30,775 

9,396 

2 802 

3 221 

4 480 

3.962 

Range 

663 

1 

,480 

24,400 

9,440 

0 43 

0.39 

0 36 

0.41 




Analysis of Variance of Logarithms 







Degrees of 

Sum of 


Mean 

Source of Variation 


Freedom 


Squares 


Square 

Kind of plankton 



3 


20 2070 

6 7357 

Haul 




11 


0.3387 

0 0308 

Discrepance 



33 


0 2300 

0 0070 


table 16.9 (10). You will observe the pronounced regression of mean on 
range. The corresponding logarithms have approximately equal ranges, 
clearly independent of the means. 

The appropriate means for the four kinds of plankton (i.e., the geo- 
metric means) are the antilogarithms of 2.802, etc.; that is, 634, 1,663, 
30,200, and 9,162. 

The standard deviation of the logarithms is \/0.0070 = 0.0837, that 
of the numbers being the antilogarithm, 1.21. Quoting Winsor and Clarke 
(page 5), “Now a deviation of 0.0837 in the logarithm of the catch means 
that the catch has been multiplied (or divided) by 1.21. Hence we may 
say that one standard deviation in the logarithm corresponds to a per- 
centage standard deviation, or coefficient of variation, of 21% in the 
catch. 55 


REFERENCES 

1. M. S. Bartlett. Journal of the Royal Statistical Society Supplement, 3:68 (1936). 

2. C. I. Buss. Plant Protection, No. 12, Leningrad (1937). 

3. W. G. Cochran. The Empire Journal of Experimental Agriculture, 6:157 (1938). 



452 


Statistical Methods 


Section 16.7 


4. George C. Decker and Floyd Andre. Iowa State College Journal of Science, 

10:403 (1936). 

5. C. W. Leggatt. Comptes rendus de 1’ Association Internationale d’Essais de 

Semences, 5:27 (1935). 

6. Hugh C. McPhee. Journal of Agricultural Research, 34:715 (1927). 

7. A. S. Parkes. Biometrika, 15:373 (1923). 

8. Karl Pearson. Tables for Statisticians and Biometricians, University Press, 

Cambridge, England (1924-1931). 

9. “Student.” Biometrika, 5:351 (1907). 

10. C. P. Winsor and G. L. Clarke. Sears Foundation: Journal of Marine Research, 
3:1 (1940). 



Chapter 17 

DESIGN AND ANALYSIS OF SAMPLINGS 


17.1 — Introduction. Having read the preceding chapters, you will 
readily agree that sampling is the central problem in statistics. From 
some population a sample of individuals is drawn. This sample is ex- 
amined in detail, such statistics as mean, standard deviation, and regres- 
sion coefficients being calculated. From these sample facts inductions 
about the population parameters (tests of significance and fiducial limits) 
are made. That is the routine procedure which you have been following 
throughout this book. 

In chapters 1 and 3, having actually drawn a large number of samples 
you verified some of the better known mathematical theorems about 
them — the standard error of the mean, the distributions of x 2 and l, etc. 
This led you to place confidence in other distribution theory presented in 
later chapters. 

Finally, you have been provided with a number of the more common 
experimental designs, incorporating accepted principles of sampling. 
Well conducted experiments with standard designs may be expected 
to yield reliable information about the sampled populations. 

We now come to a discussion of some rather general problems in sam- 
pling, with particular applications to samplings made by interview, by 
questionnaire, or by examination of limited portions of extensive areas. 
It will be clear, however, that the principles involved are in no wise dif- 
ferent from those which have governed the methods already presented. 

17.2 — Populations. Specification of a population is not always easy. 
Mathematically, it is only necessary to fix certain parameters. As an 
example, its mean and standard deviation completely determine a normal 
population. But what population arc you sampling when you do a ran- 
domized block experiment on corn yield? We think of this population as 
distributed in normal or near-normal fashion, and consisting of all yields 
of the varieties being tested wherever they might be planted on similar 
soil and with similar environmental conditions. It may even be that a 
population of this kind is wholly imaginary, as when we are testing some 
newly developed hybrid of which there is no surplus of seed, so that only 
the sample exists. Populations like these, indefinite in extent and location, 
need careful description. The investigator specifies them by giving details 
of his experimental conditions — soil type, fertilization, previous manage- 
rs] 
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ment, and seasonal characteristics such as humidity, temperature, etc. 
An example of a somewhat more definite population is that of four-year- 
old Guernsey milk-cows in Waukesha County, Wisconsin. The individuals 
of this kind of population, finite in number and confined to a limited 
area, may conceivably be enumerated in census fashion. 

These instances illustrate the variety of concepts that may be asso- 
ciated with the word, population. A more extensive discussion may be 
found in (11). Before undertaking a sampling, one must clearly delimit 
the population about which he expects to seek information. After the 
sample is summarized, he must be careful to localize his conclusions to 
the population actually sampled. 

A population is said to be homogeneous if it has a single set of para- 
meters. For example, the death rate of 30-year-old native born white 
males in Iowa would constitute a homogeneous population if every mem- 
ber of it were subject to the same probability of death, p. Again, farm 
income from a given size and type of owner farm in the Cash Grain Dis- 
trict would be a homogeneous population if its distribution could be 
described by some such set of parameters as mean, standard deviation, 
and skewness. Clearly, the exact description of homogeneous populations 
is conceptual. These descriptions form useful models, however, to which 
actual populations often approximate. 

The experimenter usually sets up controlled conditions, or describes 
actual conditions, thus limiting his populations to approximate homoge- 
neity. But the populations with which the economist, the sociologist, and 
the crop estimator deal are usually heterogeneous. No one set of para- 
meters describes any extensive group of individuals. Not only do the 
populations change with position and with time, but many populations 
may occupy the same area at the same time. These investigators attempt 
to segregate their various populations into subdivisions or strata , each 
stratum approximating homogeneity. This device of stratification has 
broad implications, some of which will be discussed below. 


17.3 — Sampling from a homogeneous population. The objective 
of sampling consists in the group of inferences which can be made about 
the population sampled. Obviously such inferences will be valuable only 
insofar as the sample is representative of the population. In this book a deal 
of attention has been given to the manner in which the sample is repre- 
sentative — the estimates it furnishes of population parameters and the 
probability statements that may be made about significance and fiducial 
limits. Now we wish to inquire the circumstances which insure represen- 
tativeness. 

People have different ideas of the restrictions imposed by the term, 
representative , because they approach sampling with different objects in 
view. Let us distinguish three objectives. The first is to get a sample 
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which will yield an unbiased estimate of the population mean. This end 
is attained if all members of the population have the same chance of 
being drawn into the sample. As an example we may suppose that there 
is a list of all the inhabitants and that we wish a sample of five per cent 
of them. Starting with any item from the first to the twentieth in the list, 
this and each twentieth item thereafter may be taken to make up the 
sample. To guarantee equal opportunity to all, it is only necessary to 
leave the choice of the initial item to some objective device such as drawing 
a numbered bean from a bag. If the bean marked 13 is drawn, then the 
sample is to consist of items numbered 13, 33, 53, etc. A similar plan is 
the grid often laid down on a city map, every tenth dwelling or family 
along each fifth street, say, being visited. Care must be taken, of course, 
to avoid multiples of the average number of dwellings on a block. With 
obvious minor precautions, samples drawn in such manner furnish the 
estimate desired. Clearly a sample so planned has the virtue of being 
spread uniformly over the list or area. 

Another objective in sampling is that the estimates shall be as accurate 
as possible for the time and money spent. It is possible that a nonrandom 
sampling may accomplish this while a random sample would be less 
accurate. In the hands of experienced investigators with intimate knowl- 
edge of the population a selected sample may give good approximations 
to the population parameters. But the hazards in such sampling are far 
greater than those where chance has free play. One reason is that human 
choice is too often biased, more particularly in respect of variation (10) 
(2). Another is that all probability theory rests on the laws of chance so 
that in default of random selection one cannot safeguard his findings by 
knowledge of significance and fiducial limits. The disinterested scientist, 
wishing to make his samplings as nearly objective as possible, usually 
prefers to avoid all possibility of influencing his findings by selection of 
the individuals upon which they are based, and at the same time to take 
advantage of the internal evidence about variation which is inherent in 
random samples. 

Still a third objective in sampling is one which has been rather per- 
sistently presented in this book, that the sample shall furnish an unbiased 
estimate of variance as well as of the mean, thus making possible tests of 
significance and fiducial estimates. To reach this objective we may 
specify that, in case samples of size n have been decided upon, every pos- 
sible sample of n individuals shall have the same chance of being drawn. 
This condition is clearly not met in taking every twentieth item from a 
long list because there are only 20 such samples available out of the multi- 
tude possible. The condition is met, however, by drawing at random n 
individuals from the population. The identity of the two concepts is 
shown as follows: imagine drawing samples of three from a population 
with six beans, A, B , . . . F. The theory of probability as developed in 
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any text on algebra leads to the conclusion that there are 20 different 
samples which are possible, 


ABC 

BCD 

CDE 

ABD 

BCE 

CDF 

ABE 

BCF 

CEF 

ABF 

BDE 


ACD 

BDF 


ACE 

BEF 



ACF 

ABE 

ADF 

AEF 

The probability of drawing any one of these samples is 0.05. Having 
drawn at random three beans without replacement, you find yourself in 
possession of some one of these triplets, thus satisfying the requirement 
of equal likelihood for all samples. Replacement is not necessary to insure 
equal chance to every sample. If it were, one would not be able to divide 
a group of pigs at random into lots of n. 

Care is necessary in describing the population being sampled, and in 
affording equal chance of selection to every individual or to every sample. 
A sample drawn in random manner from a list of telephone subscribers is 
representative of the population of subscribers whose names are included 
in the list, but is unlikely to be representative of voters, of taxpayers, or of 
all inhabitants. People responding to a mailed questionnaire usually 
constitute a selective group among those receiving it: those who from 
habit or from oversight do not respond cannot be said to have any chance 
at all of appearing in the sample despite the fact that the questionnaire 
was sent to them. For the same reason, housewives who are not found at 
home by an interviewer must be called upon till found if a representative 
sample is to be got. These examples illustrate the pitfalls that must be 
avoided in sampling. 

EXAMPLE 17.1 — If a sample of 10 is drawn without replacement from a limited 
population of 100 individuals, is the probability of being drawn the same for every 
individual? 

EXAMPLE 17.2 — How many different samples of four individuals can be drawn 
from a population of 10? What is the probability of drawing one of them? 

17.4 — Size of sample. The designer of a sampling investigation is 
always confronted with a decision about the size of the sample. While 
resources of money and personnel often set limits which must not be 
passed, there is usually opportunity to take advantage of any basis of 
judgment that may be available. 

Investigators are often content with samples less than 0.1% of the 
population, while a sample of 10% is usually considered large. Such loose 
statements can be replaced by more definite ones only if there is available 
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some knowledge of the mean and standard deviation of the population 
to be sampled. Although this knowledge may not be precise it enables us 
to make fair approximations to sample size. 

We sometimes start with information about the standard deviation of a 
previous sample of n. If the sample was large, there may be little lack of 
precision in assuming that another sample will produce the same s. Take x 
as the estimate of m in the proposed sample, then the fiducial limits are 
specified by 

x — m 

s/Vn 

Solving for n, 

n = t 2 s 2 /(x - m) 2 

We may now ask this question: how large a sample will be required to fix 
the fiducial limits at l units from the sample mean, where / = 2 — m? 

An example is found in an experimental sample to estimate the yield 
per acre of wheat in certain districts of North Dakota in 1938 (4). The 
variance of 222 fields was 90.3. For so large a sample, t = 2.6 may be 
taken as a safe approximation to the 1% level. Suppose we wish to say, 
after the sample is taken, that the probability is 0.99 that the true mean is 
within / = 2 bushels per acre of the sample mean. Then 

* = (2.6) 2 (90.3)/2 2 = 153 fields 

is a guide to the required sample size. 

In the absence of knowledge about the variance of an earlier sample, 
estimates may be made in the manner of sections 5.7 and 5.8. One may 
know, for example, that the percentage of farm land in crops in a certain 
region varies from 30 to 80. From this it may be inferred that 

j = (80 — 30) / 6 — 8%, and hence, s 2 = 64 

How many farms in this region should be included in a random sample to 
get an estimate of percentage farm land in crops within 1 point of the 
population mean? 

n = (2.6) 2 (64)/l 2 = 433 farms 

If it is the coefficient of variation about which you have knowledge, 
the fiducial interval should be expressed as a percentage of the mean. The 
question may be asked: what sample size may be expected to indicate m 
not further away from x than p% of x? That is 

x — m — px/ 100 
Substituting in the formula for n, 


n = (100 ) 2 t 2 s 2 /p 2 x 2 
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But the coefficient of variation is given by 

C 2 = (100 )V/& 

whence, 

n = t 2 C 2 /p 2 

As an example, if it is known that the coefficient of variation for carcass 
weight of swine in a certain class is C = 5%, we may inquire the number 
of randomly selected carcasses necessary to approximate the carcass mean 
within p = 3%: 

n = (2.6) 2 5 2 /3 2 = 19 carcasses 

In such estimates, 2.6 is suggested for / as a safeguard against eventual 
sampling variation. The actual sample may turn up with unusual mean 
and standard deviation. To be on the safe side, somewhat larger samples 
than those calculated above should be drawn, especially if the indicated 
size is small. However, if you can risk a little less accuracy than you hope 
for, you may use the estimates of n with considerable confidence. 

17.5 — Sampling from small populations. There are instances in 
which the sample comprises 5% or more of some limited population, as 
when percentage of farm land in corn is sampled from a county with a 
section of land as the sampling unit. For example, suppose a sample of 
n — 64 section percentages has been drawn at random from a county 
whose population of such units is N = 576. The standard deviation of 
the sample calculated in the usual way is, say, s = 10%. It seems reason- 
able that the possibilities of variation in sample means are more and more 
restricted as the sample size approaches that of the population. The 
formula for standard error appropriate for means drawn from a finite 
population is modified to (3) 



which reduces to the usual form as JV increases toward infinity. Substi- 
tuting s = 10, n = 64, N = 576, 

5^ - (>.25K°.«3) - U8%, 

instead of the value, 1.25%, which would have been calculated for the 
same sample from an unlimited population. 

Adjustment should also be made in the binomial formula of section 16.2, 

= Vpq/n, 

if objects are drawn without replacement from a limited group. Suppose 
50 families in a community of 500 are asked by telephone if they are 
listening to a certain radio program, 10 replying “yes.” Then, p = 0.2, 
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q 5 = 0.8, and the best estimate of standard error of the mean proportion 
p = 0 . 2 , is 

' - \/7 -y/> - jf ” ^/ «>•<»“> ^ - »-<> 53 . 

instead of \/0.0032 = 0.057. 

EXAMPLE 17.3 — If a sample of four from the 16 townships of a county has the 
standard deviation 45, show that the standard deviation of the mean is 19.5. 

EXAMPLE 17.4 — Following is the analysis of variance for an assumed sample of 
Iowa corn condition figures: 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

District 

8 

39,638 

40,161 

4,954 8 

Condition within district 

171 

234.86 


Assuming samples of equal size and equal variance within the several districts, that is, 
f i » 234.86, show that a sample of some 102 per district will be required to restrict the 
5% fiducial limits to one unit (1%) condition on either side of the mean. 


17.6 — Sampling from two or more homogeneous populations. This 
problem has been considered in chapters 4, 6, 7, 9, and all following 
chapters. Estimates of the mean and variance of each sampled population 
were provided, together with tests of significance. In all of the foregoing 
we have assumed that the members of one population could be distin- 
guished from those of another and that each population could be sampled 
independently. There remain some questions which we have not had 
occasion to consider heretofore. 

Before proceeding, it may be wise to call attention to a prevalent 
fallacy. It is rather a common idea that always increased precision is 
attainable through larger sample size, since s± = s/y/n. What is often 
overlooked is that this law applies only to sampling from a single homoge- 
neous population. If the process of sampling carries you beyond the 
bounds of one population you may find that precision actually decreases 
as the sampling continues. It is to avoid this chance that experimental 
plots are grouped into blocks of uniform fertility, each block constituting 
a small population separately sampled. Similar devices are available in 
all circumstances where heterogeneity can be foreseen, as will be clear 
from the pages to follow. It is only necessary here to clarify thinking about 
the advantages of large samples. Since uniform experimental material 
is often limited in amount, several small samples, each from a homoge- 
neous portion, usually yield more precise information than a single large 
one. 

There is another fallacy about sample size sometimes observed in 
reports of sampling. An investigator may be confronted by a choice be- 
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tween securing a large number of interviews in a limited area where 
travel is cheap, or being reconciled to a smaller sample scattered at random 
over the entire region about which information is desired. The latter is 
the only way to insure equal chances of selection for every individual or 
sample in the population. Instances are on record where the entire sam- 
pling was confined to a selected area, such as a township, every indi- 
vidual in the chosen area being enumerated. While this may insure 
complete knowledge of the facts in the township, it concurrently insures 
complete ignorance of the facts in the remainder of the region. Repre- 
sentative samples must be scattered at random over the entire population 
even if only a small proportion of the individuals can be enumerated. A 
small sample properly designed contains information up to the limits of 
its size, but even a large sample confined to a portion of the population is 
devoid of information about the excluded portions. 

In designing a sampling procedure involving a heterogeneous aggre- 
gate or universe , all available information can be used. Neyman has shown 
(6) that if the investigator knows the boundaries of the several homoge- 
neous populations (subdivisions or strata), together with the size N and 
standard deviation, <r, of each, the most efficient sampling consists in 
taking at random k sampling units from each subdivision so that k is 
jointly proportional to N and <r. Suppose, for example, that corn yield per 
acre is normally distributed in each of three type-of-farming areas, as 
follows: 


; 

Area 

Number of Acres 
of Corn 

N 

Standard 

Deviation 

<r 

Product 

JVcr 

Size of Sample 
k = No {n/S) 

1 

10,000 

13 

130,000 

338 

2 

20,000 

10 

200,000 

519 

3 

5,000 

11 

55,000 

143 


T * 35,000 S = 385,000 n - 1,000 


The total sample, n , should then be divided into three parts proportional 
to the three products, JVcr. To do this, add the three values of Nv , calling 
the sum, S. That is, S — 385,000. Decide upon the size of the entire 
sample, say n = 1,000. Multiply the quotient, 

n/S = 1,000/385,000 = 0.0025974, 

by each value of JVcr to get the corresponding k. For example, in area 1, 
k = (130,000) (0.0025974) = 338 

The three values of k should, of course, add to the chosen n = 1,000. 

While exact knowledge of variation within the subdivisions is unusual, 
approximations of sufficient accuracy are often available. Previous 
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samplings or census data may be pertinent. A small experimental sample 
might be resorted to. Even general information about range may serve 
to approximate a (sections 5.7 and 5.8). 

One often has knowledge of the sizes of proposed subdivisions even 
if the standard deviations are unknown. If so, it is usually well to make the 
k * s proportional to the Jf s. The investigation may not be so efficient as if 
the standard deviations were known, but will be as efficient as possible 
with thedimited information available. 

If there is no information about either N or a, one should ordinarily 
make the sampling completely random, subdivision being inappropriate. 

17.7 — Analysis of data from two or more populations. We now 

suppose that the survey is completed and that the results are to be ex- 
amined statistically. The methods are in the main familiar. For sim- 
plicity, let us consider first the case requiring no new devices at all, that 
in which the subdivisions are equal in size and in variance, only their 
means being different. The sample will then consist of equal numbers of 
items in the several strata. In table 17.1 is presented a summary of the 

TABLE 17.1 

Analysis of Variance of Results of a Random Sample of 100 From Each of 



Three Subdivisions 

Population of each subdivision, 10,000 



Source of Variation 

Degrees of 
Freedom 

Sum of 

Squares 

Mean 

Square 


Total 

Subdivisions 

299 

2 

14,755 

1,600 

800 


Individuals 

297 

13,155 

44 29 



£ * 71.21, 

j - 6.655 




results of such a sampling, the total sample of 300 equally divided among 
the three subdivisions. 

Clearly the means of the subdivisions differed significantly so that the 
stratification is justified. In section 17.9 we shall evaluate the effect of 
the subdivision more precisely. Since the strata are known to be equal in 
size, the mean of the sample, 71.21, as well as the variance, 44.29, are 
unbiased estimates of the corresponding parameters of the universe of 
three populations. The variance of the mean is, therefore, 

44.29/300 = 0.1476, 

the standard error being 0.3842. For the 5% fiducial limits (section 3.8) 
we have d.f. = 297, = 1.968 and 

t MSi = * (1.968) (0.3842) - =*=0.76 
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If we say, now, that the mean of the universe lies between 71.21 — 0.76 
* 70.45 and 71.21 + 0.76 = 71.97, or that our sampling has led to a 
mean which is within 0.76/71.21 = 1.1% of the true value, we shall be 
right unless a 1 in 20 chance has come off. 

Ordinarily it is the universe total which must be estimated, 

(30,000)(71.21) - 2,136,000, 

with fiducial limits 30,000 times those of the mean, 2,114,000 and 
2,159,000. 

Having looked at the simplest problem in stratification, let us turn to 
the more realistic one in which the sizes of the subdivisions along with 
their means and their variances are all different. Results of a survey of 
this kind, such as might have been got from the procedure designed in 
the preceding section, arc displayed in table 17.2. Numbers of acres in 


TABLE 17.2 

Analysis of Variance and Summary of the Data From a Random Sample of 
Corn Yields in Three Type-of-farming Areas 


Source of Variation 

Degrees of 
Freedom 

Sum of 

Squares 

Mean 

Square 

Individual areas: 

1 

337 

55,689 

165 25 

2 

518 

54,742 

105 68 

3 

142 

17,044 

120 03 

Within areas 

997 

127,475 

127 85 

Type-of-farming areas 

2 

3,787 

1,893.50 




Mean Yield 

Stratum number 

Total Acres 

N 

Sample Size 
k 

(bu. per acre) 

X 

1 

10,000 

338 

42 10 

2 

20,000 

519 

44 92 

3 

5,000 

143 

39 61 

Total 

35,000 

1,000 



the three areas are assumed to be accurately known. 

The estimated mean of the population is a weighted mean of the sub- 
division means, the weights being the subdivision sizes: 

weighted mean = (SNx)/SN 

Using the data of the example in hand, 

(10, 000)(42.10) + (20,000) (44.92) + (5,000)(39.61) 

= 43.35 


10,000 + 20,000 + 5,000 
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bushels per acre. From the method of calculation, you see how important 
it is to have exact knowledge of the s, for even small errors in these 
weights may introduce noticeable bias in the estimate of the mean. 

For ease in computation the weights may be reduced to proportional 
small numbers, such as 10, 20, and 5 (or 2, 4, and 1) in the example 
above; or to decimal fractions such as 2/7 = 0.28571, 4/7 = 0.57143, 
and 1/7 = 0.14286, the sum of these decimal weights being unity. The 
latter is usually the easier method. Calculation then is confined to the sum 
of products, easily run on a machine: 

(0.28571)(42.10) + (0.57143)(44.92) + (0.1 4286) (39.61) = 43.36 


bushels per acre. 

The estimated variance of the universe mean is also a weighted average 
of the variances of the several subdivision means, the weights being the 
squares of those for the mean: 

variance = (SJW / k) / (67V) 2 = SwW/k, 
where w stands for weight and Sw = 1. In the numerical example, 

(0.28571) 2 + (0.571 43) 2 + (0.14286) 2 = 0.12353 


338 


519 


143 


From the variance we calculate the standard error of the mean, 0.3515 
bushels per acre. 

Since the subdivision variances are not pooled in the manner of 
chapters 4 and 10, but are combined into a weighted average, one cannot 
assign degrees of freedom to the result. It follows that the ratio, x/s 4 , is 
not distributed as /, so that exact statements about fiducial limits cannot 
be made. Nevertheless, since the subsamples are large, the normal distri- 
bution (/ = infinity) can be used with little lack of precision. With this 
qualification we may say that the 5% fiducial limits are 

43.35 =* (0.3515)(1.962), 

that is, 42.66 and 44.04. 

As before, the mean and fiducial limits, expressed as bushels per acre, 
may be expanded to production figures by multiplication: 

Total: (35,000)(43.35) = 1,517,000 bushels 
Limits: 1,493,000 to 1,541,000 bushels 

Let us pause to look at the formula (3) for the variance of the weighted 
mean of several subsamples drawn from populations of limited size 
(section 17.5), 

S[N(N - k)s 2 /k]/(SM) 2 

If the samples comprise 5% or more of the populations, this formula 
should be substituted for the simpler one given above. If weights, w> are 
used in the calculations, and if iSzc; = 1, the formula just stated reduces to 
S(w*s*/k) - (Sws*) / 7*, 
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where T = SJV, the total for the universe. The second term constitutes 
a correction to be subtracted from the variance, S(w 1 s 2 t /k) — 0.12353, 
already calculated. Since, in the corn yield data, 

Sws* = (0.28571) (1 65.25) + . . . + (0.14286)(120.03) = 124.75, 
the correction is 124.75/35,000 = 0.00356. The resulting variance, 
0.12353 - 0.00356 = 0.11994, 

differs from the first in the third significant figure although the samples 
are only from 2% to 4% of the population sizes. 

As is clear from the formulas which were used for weighted mean and 
its variance these estimates may be appreciably biased if the N * s are not 
accurately known. Gain in precision expected from stratification and 
large samples may be offset by poorly determined stratum sizes; in fact, 
serious loss of precision may occur. Cochran (3) gives a quantity by which 
the variance of the weighted mean is increased (as a first approximation) 
if there is some information as to the variance, % 2 , associated with the TV’s: 

s[(x - **)V]/(M0 2 = $[($ ~ ogv] 

where x w is the weighted mean calculated above and sj is the variance 
of a weight. 

If we assume for illustration that the w's (or the jV’s) are determined 
with coefficients of variability each 5%, we have the three standard 
deviations 0.014286, 0.028572, and 0.007143 acres with corresponding 
variances as set down in the table below. Also, x w = 43.35 bushels per 
acre. Substituting these values with the three stratum means, x , found in 
table 17.2, the correction is calculated so: 


Subdivision 

£ — x w 

(s - xj 1 


1 i 

-1 25 

1 562 

0 0002041 

2 

1 57 

2 465 

0 0008164 

3 

-3.74 

13 988 

0 0000510 


Thus the correction to be added to 0.12353 is the sum of the products in 
the last two columns = 0.00304. This represents a decrease in information 
(section 10.12) of 2.4% attributable to errors in the weights. If the gain 
due to stratification were large (because of large differences, x — x w ) the 
loss from inaccurately determined weights would be correspondingly 
augmented. 

Where knowledge of strata is vague, a well conducted sampling over 
the universe may be safer than attempted subdivision. How may that be 
handled? To illustrate the method, let us suppose that a random sample 
of 1,000 items has been drawn from the universe of type-of-farming areas 
which we have been studying. For purposes of comparing results, let us 
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assume that by a lucky coincidence the mean has turned out to be the 
same as the weighted mean of the three subsamples in table 17.2, 

(338) (42.10) + (519)(44.92) + (143)(39.61) „ ^ L , 

= 43.21 bushels per acre 

338 + 519 + 143 K 

while the variance, by an equally happy chance, is, as before, 127.85. 
Since these are now taken to be the statistics computed from a random 
sample of 1,000, with no stratification, they are thereby unbiased estimates 
of the parameters The standard error of the mean is 

\/l27. 85/1, 000 = 0.3575 bushels per acre 

To expand the sample mean to a production total for the universe, 
some estimate of the number of acres is required. Let us assume that this 
estimate is 35,000, but that now it is not accurately known. We have, 
then, the estimate of total production, 

(35,000) (43.21) = 1,512,000 bushels 

If we arc to arrive at an appropriate variance for this total we must 
first get some idea of the variance of the estimated size of the universe. 
This is likely not known with any precision. However, people familiar 
with the situation may venture the statement that 35,000 is probably 
within 1,000 of the correct number. If this is interpreted as meaning that 
the fiducial limits are 35,000 =*= 1,000, a fair estimate of s , using t = 2, is 
s = 500. This reasoning is presented as one way of arriving at an approxi- 
mate value of s. If more reliable information is to be had, use it. 

If the squares of coefficients of variation are used in the calculation, 
the variance of a product of uncorrelated factors is, to a first approxima- 
tion, the sum of their variances (7) (8). We, therefore, express our esti- 
mates of variance as follows: 

Yield per acre: (100) 2 (0.12786)/(43.21) 2 = 0.6848 
Acres: (100) 2 (500) 2 /(35,000) 2 = 2.0408 

The estimate of the variance of production in the three districts is, then, 

0.6848 + 2.0408 = 2.7256, 

the standard error being 1.6509%. Hence, the estimate of production 
has the standard error, 

(1,51 2,000) (0.01 6509) = 24,960 bushels 

While an exact statement of fiducial probability would be inappropriate, 
we may set the analogous limits, 

1,512,000 =*= 2(24,960), 

that is, 1,462,000 and 1,562,000 bushels, and think of them as approxima- 
tions — provided our estimate of the variance of population total was good. 
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The larger spread of these limits, compared to those set before, reflects 
the less accurate information which we are now assuming. 

Those two percentage estimates of variance, 0.6848 and 2.0408, are 
informative. That of acre yield adds little to the total. Consequently, 
only slight precision can be gained for the production figure by refine- 
ments in sampling yield. If much narrower limits are to be set, it is the 
estimate of total acres that must be improved. 

If an attribute is being sampled, and if it is assumed to be binomially 
distributed within the several strata, the means, p , and variances, pq/n , 
are substituted for x and s 2 in the formulas for estimating the mean and 
its standard error in the universe. As an example, consider a sample of 
692 families in Iowa to determine, among other things, how many had 
victory gardens in 1943. The data were as follows: 


Stratum 

Number of 
Families 

Weight, 

w 

Number in 
Sample, 
n 

Number 

With 

Gardens 

Percentage 

With 

Gardens 

Urban 

312,393 

0.4451 

300 

218 

72.7 

Rural 

161,077 

0 2297 

155 

147 

94 8 

Farm 

228,354 

0.3252 

237 

229 

96.6 


701,824 

1 0000 

692 

594 



The numbers of families were taken from the 1940 census. Since there 
was no information about variance, the sample was allotted approximately 
in proportion to the number of families per stratum, a sample of 1 per 
1,000 being aimed at. 

The weighted mean percentage of Iowa families having gardens was 
estimated as, 

Spw = (72.7) (0.4451) + (94.8)(0.2297) + (96.6) (0.3252) « 85.5% 
The variance of this mean: 

Sw 2 pq/n = (0.4451) 2 (72.7)(27.3)/300 + etc. = 1.62 

The standard error, then, is 1.27%. 

If approximately normal distribution is assumed in these large samples, 
one may set fiducial limits subject to the same reservations as before: 

85.5 ± (1 .96) (1 .27) : 83.0% and 88.0% 

17.8 — The structure of sampling investigations. In setting forth 
the randomized block and latin square designs of chapter 11, nothing 
was said about the possibility of sampling the plots instead of harvesting 
them completely. (For an example with references, see 9.) Each investi- 
gator, with little added expense, may plan his experiment so as to get 
information about the efficacy of this procedure. While we shall illustrate 
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EXAMPLE 17.5 — The following estimates of cotton yield (pounds per acre) as ot 
December 1, 1939, were reported by correspondents and furnished through courtesy 
6f Samuel T. Marsh (5), Agricultural Statistician, U.S.D.A., Knoxville, Tennessee. 


District 

Weight, 

w 

Number 

Reports, 

k 

Degrees 

of 

Freedom 

Sum of 
Squares 

Mean 

Square, 

M 

Mean 
Pounds 
Per Acre 

1 

0.359 

77 

76 

566,614 , 

7,455 

343 7 

2 

0.466 

184 

183 

908,046 

4,962 

233.2 

3 

0 076 

40 

39 

128,145 

3,286 

188.2 

4 

0.060 

31 

30 

169,994 

5,666 

266 4 

5 

0.016 

5 

4 

31,330 

7,833 

287.0 

6 

0 023 

9 

8 

89,800 

11,225 

265.0 


1.000 

346 

340 

1,893,929 

5,570.4 



Compute the weighted state mean, 273.03 pounds per acre, and its variance 20.527. 


EXAMPLE 17.6 — The analysis of variance of the Tennessee cotton yield estimates 
is this: 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Reports from same district 

340 

1,893,929 

5,570.4 

Districts 

5 

880,084 

176,017 

Total 

345 

2,774,013 



Verify this analysis approximately by use of the data in example 17.5. 


EXAMPLE 17.7 — It is thought that the weights in the cotton data are not in error 
as much as 2%. Assume, then, that their standard deviations are one sixth of the 
ranges, 0.04a;. Show that this error in the weights decreases precision by less than 1 %. 


the method by use of a field plot layout for maize, all the principles apply 
directly to samplings of social or economic facts. The analogy will be 
pointed out later. (See also, 6 page 614.) 

Our data are derived from a cooperative project of the United States 
Department of Agriculture and the Iowa Agricultural Experiment Station, 
the purpose being to study the development of the corn plant in relation 
to its environment. The layout was randomized blocks with four varieties 
replicated in five plots. Much of the growing material was cut during the 
summer for dry matter increase and chemical studies. Reserved for yield 
determination were eight sampling units in each plot. These units constitute 
the random elements in the sampling, their variation being the basis of 
sampling error. Shape, size and placement of the sampling units arc a 
major problem in design. In the corn experiment, each consisted of four 
adjacent hills at the vertices of a rectangle. These hills, harvested sepa- 
rately, are the units of measurement , systematically placed in the sampling 
units. 

Sampling units sometimes are not randomized over the whole plot but 
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are grouped into two or more sampling areas (subdivisions or strata). This 
precaution, often referred to as local control , insures more uniform coverage 
of the plot by the sampling units, preventing the possibility of their con- 
centration in some one area. Variation in fertility over the plot thus 
tends to be equalized in the sampling. For the kind of study we are 
making there must be at least two sampling units in each sampling area. 
The entire structure of the experiment is set out in table 17.3. 


TABLE 17.3 

Analysis of Variance of Sampling Data From a Randomized Block Field 
Experiment With Maize. Ames, Iowa, 1939. Individual Hill Basis 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Variety 

3 

6 0723 


Block 

4 

1 7006 


Experimental error 

12 

1 6930 

0 1411 

Subdivisions within plots 

20 

2 9479 

0 1474 

Sampling units within 
subdivision 

120 

14 4387 

0 1203 

Hills within sampling units 

480 

38 9200 

0 0811 

Total 

639 

65 7725 



1 sampling unit = 4 hills 

1 subdivision = 4 sampling units = 16 hills 

1 plot = 2 subdivisions — 32 hills 

Entire experiment = 20 plots = 640 hills 


The twenty whole plots constitute a randomized block experiment of the 
kind discussed in chapter 11. Here we are concerned not with tests of 
significance but rather with the composition of the experimental error — the 
possibilities of changing it by alterations in the design of the sampling. 

A summary of the calculations may be convenient — they were ex- 
plained more fully in section 10.9. After computation of the total sum of 
squares for the 640 hills, the steps are as follows, the Scime correction 
term being used throughout, (i) Compute the sum of squares for the 160 
sampling units, each consisting of 4 hills, then deduct this from the total, 
leaving the sum of squares for hills within sampling units, 38.9200; (ii) 
compute the sum of squares for the 40 subdivisions, each containing 16 
hills, and deduct this from the sum of squares for the 160 sampling units, 
leaving 14.4387, the sum of squares for sampling units within subdivisions; 
(iii) compute the sum of squares for the 20 plots, each having 32 hills, and 
subtract this from the sum of squares for the 40 subdivisions, the remainder, 
2.9479, being designated as subdivisions within plots; (iv) the whole-plot 
analysis is then completed in the usual manner. 

In table 17.3, the comparison that chiefly concerns us is that between 
the two mean squares representing random variation, the sampling and 
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experimental errors. The latter was examined rather carefully in section 
1 1.3: it represents the variation of a set of deviations from regression values, 
assumed to be a randomly assorted lot from a normal population 'With 
mean, zero, and standard deviation, <r. Let us imagine the source of one 
of these deviations. If it is positive, the corresponding yield is more than 
was estimated for the plot, situated in its block and planted to its variety. 
Why? Perhaps in fertility it was particularly favored among plots of the 
same block. Perhaps also it was planted with seed that produced plants 
more vigorous than average. These advantages it might enjoy whether 
the harvesting was complete or by samples. But if only a sample is taken, 
the deviation may contain another component due to the random selection 
of an unusually productive batch of sampling units. Thus, each deviation 
from expected is made up of a portion peculiar to the plot and a second 
portion due to sampling within the plot. The variances of the two portions, 
being independent, are to be added. 

In table 17.4 are copied the pertinent data about sampling and experi- 
mental errors. The variances of the means of sampling units and plots are 


TABLE 17.4 

Sampling and Experimental Errors From Table 17.3, Together With 
Components of Variance 


Source of 
Variation 

Mean 

Square 

Number of 
Units of 
Measurement 

Variance 
of Means 

Variance Is an 
Estimate of 

Experimental 

0 1411 

32 

0.004409 


Sampling 

0 1203 

4 

0 030075 



J-s 2 - 

0 030075 = 

0.000650 



calculated by dividing the corresponding mean squares by the respective 
numbers of hills (units of measurement). The sample estimates of the 
variance components are 

s s 2 = 0.030075 

j* 2 = 0.004409 - (0.030075) /8 = 0.000650 

It should be observed that local control (that is, stratification) was 
quite effective in eliminating variation from experimental error. In all 
our discussion of this experiment, we are assuming that sampling will be 
confined to these same strata , so that differences between them do not affect 
our estimates of error. 

The sample estimate of the variance of a plot mean may now be written, 
Si 2 = sj? + sf/k = 0.000650 + 0.030075/*, 

where k is the number of sampling units per plot. For the experiment as 
designed, * = 8, and 


= 0.000650 + 0.003759 
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To verify this, notice that the sum is 0.004409 as in the table. 

It is now evident that the major portion of the variance of the plot 
mean has its source in the sampling of the plots. The implication is 
clear: to increase the precision of an experiment like this, alter the design 
so as to include more sampling units per half-plot (stratum). Suppose, for 
example, 24 sampling units were taken from each subdivision, then 
= 0.000650 + 0.030075/48 
= 0.000650 + 0.000627 = 0.001277 
Thus, the relative information from this new design would be 

1/0.001277 0.004409 _ 

1/0.004409 “ 0.001277 “ 3A 

times that in the original, an increase of 245%. (See example 17.10 for 
some formulas involving “information.”) But the increased information 
would be got at the expense of 6 times as much harvested crop. The 
efficiency of an experiment depends on the information per unit cost. 
Ordinarily, the entire crop is harvested anyhow, so that little extra cost 
is incurred from getting all the information. But in the experiment under 
consideration this was not possible because the other hills had been re- 
moved earlier for dry matter determinations. 

For sake of illustration, let us now suppose that the variance of the hills 
within our sampling units is an unbiased estimate of the variance of hills 
selected at random. It probably is not, but is controlled in some manner 
by the systematic arrangement of the hills in the sampling units. Whether 
and how it would be affected by changing the pattern is a matter of con- 
jecture. Nevertheless, by assuming that the hill variance would be un- 
altered if the size of the sampling unit were varied we can show how ran- 
dom data might be used for further study of the design. In table 17.5, the 
components of variance are estimated under this new hypothesis, 

TABLE 17.5 

Components of Variance for Table 17.3 on the Assumption of Random 
Selection of the Hills 


Source of 
Variation 

Mean 

Square 

Units of 
Measurement 
Hills 

Variance 
of Means 

Variance Is an 
Estimate of 

Experimental error 

0 1411 

32 

0.004409 

<r V 3 2 -f as 7 / 8 -f- og* 

Sampling error 

0 1203 

4 

0 030075 

a 1 ! 4 -f os * 

Units 

0 0811 

1 

0 0811 

a* 

<r a 

- 0.0811 

as' = 0.009800 

ag' = 0.0006500 


and we can write, 

Si 2 = V + Ss 2 k s /k s + s 2 /k B 

= 0.000650 + 0.009800/8 + 0.0811/32 
= 0.000650 + 0.001225 + 0.002534, 



Section 17.8 


Design and Analysis of Samplings 


471 


the sum being 0.004409 as before. The k’s are the numbers of hills per 
sampling unit and plot, respectively: k s /k B is the number of sampling 
units per plot. 

As before, an increase in the numbers of hills and sampling units is 
indicated. If the number of hills were quadrupled and the number of 
sampling units doubled, 

sf = 0.000650 + 0.009800/16 + 0.0811/128 

= 0.000650 + 0.000612 + 0.000634 = 0.001896, 

the relative information now being 0.004409/0.001896 = 2.33 times the 
original amount, an increase of 133%, but at the expense of 4 times as 
many hills. 

A more realistic question would be, “What would be the effect of 
decreasing the hills per sampling unit and increasing the number of sam- 
pling units proportionately, making no change in the amount of harvested 
material?” Take, for example, only 2 hills per sampling unit, allowing 
8 sampling units per stratum: 

=>.000650 + 0.00980/16 + 0.0811/32 
= , 0.000650 + 0.000612 + 0.002534 = 0.003796, 

an increase of 16% over the original information with no increase of 
expense except the inconvenience of locating the additional sampling 
units. 

Is anything gained by sampling a plot rather than harvesting it en- 
tirely? This is the moot question as to sample versus census. Clearly, 
precision is at the maximum (sampling error = 0) if a census is taken. If 
there is no increase in cost of harvesting, in threshing losses, etc., then the 
complete harvest is obviously more efficient. 

It should be noted that in the experiment being discussed there was 
little opportunity for changing the design — there was no further material 
available for sampling. Aside from the slight advantage of scattering the 
sampling units more widely (decreasing the number of hills in each) the 
only way to have gotten more information was to have increased the 
number of replications. 

Let us look at another analysis of variance (table 17.6)^ not uncommon, 
in which experimental error is much greater than sampling error. Here 
s s 2 is 0.01664 with s 2 / 8 only 0.0010. The reader may verify the fact 
that in these circumstances doubling the number of sampling units will 
increase information less than 3%, whereas an increase in replication 
from 5 blocks to 6 will make a corresponding increase of 1/5 = 20%. 
This emphasizes the fact that the sample-census problem should be ex- 
amined anew in each situation where the answer is worthwhile. One 
object of this chapter is to set out some of the more useful methods. 

In the social sciences, the sampling unit may comprise a family, the 
households along a city block, a township, a section of farm land, etc., 
the corresponding unit of measurement being perhaps the individual, 
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TABLE 17.6 

Analysis of Variance to Illustrate Situation in Which Sampling Error Is Small 
Compared to Experimental Error. The Structure of the Experiment Is the 
Same as That of Table 17.3 


Source of Variation 

Degrees of 
Freedom 

Mean 

Square 

Number 
Sampling 
Units, k 

Components 
of Variance 

Variance 
of Mean 

Variety 

3 

2 0241 




Block 

4 

0 4252 




Experimental error 

12 

0 1411 

8 

= 0 01664 

0.01764 

Sampling error 

1 120 

0 0080 

1 

jt « 0.00800 



the family, the section, or the farm. It is the sampling unit that must be 
randomly selected. Its units of measurement form a fixed pattern. The 
entire population is subdivided (or stratified) into sampling areas like 
cities, income levels, or type-of-farming areas within which the population 
under investigation is supposed to be homogeneous. 

Investigators who can spare the time and money to carry through the 
extra details of sampling may be well rewarded if they discover that 
inexpensive changes in design result in material increases in efficiency. 

EXAMPLE 17.fr — This is a modified form of the analysis of variance of the data 
collected in a wheat sampling survey (4) of 5 districts in North Dakota (1938): 


Source of Variation 

Degrees of 
Freedom 

Mean 

Square 

Number of 
Samples, k 

Districts 

4 

329 

88 

Fields within district 

215 

65 

2 

Samples within field 

220 

19 

1 


Two samples were taken at random in each of 44 fields randomly selected in each of 
the 5 districts. Calculate the components of variance. Ans j/> 2 = 3, sy 1 *= 23, s* ■= 19. 

EXAMPLE 17.9 — What is the relative information to be expected from each of the 
following arrangements? (i) double the sampling within each field, (n) double the 
number of fields; (iii) take only one sample from each of double number of fields; and 
(iv) take 4 samples from each of half the number of fields. Ans. 1.03, 1.11, 1.08, 
and 0.88. 

EXAMPLE 17.10 — If so 1 Is the original error variance and s* that resulting from 
some change in design, show by use of the definition in section 10.12 that: 

(i) Relative information == j 0 *A 2 

(ii) Increase in information = (joV-r 8 ) ~ 1 

(i) may be greater than 1 or less, so that (ii) may be positive or negative. Both quanti- 
ties may be expressed in percent by moving the decimal points 2 places to the right. 

EXAMPLE 17.11 — An experiment is being designed to test the nutritive value of 4 
feed supplements. The weight of 5-week chicks is the variate to be measured. Ackerson 
and Mussehl (1) reported the average weights and standard deviations of 5-week 
single comb white Leghorns: 

Males: weight, 260 grains; standard deviation, 58 grams. 

Females: weight, 242 grams; standard deviation, 49 grams. 
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It is sufficiently accurate to take the pooled bird variance as (58*-+-49 a )/2 « 2,880. 
The randomized blocks design is to be used with 6 replications and 10 chicks per cage, 
so that the analysis of variance is 

Treatments 3 

Replications 5 

Error 15 s* -f- 10 j** 

Chicks within cages 216 s 2 

Assume the component due to environmental differences among the cages is 
10% of 2,880 =« 288 * sg< 2 . The error mean square will then be 5,760. What differ- 
ence between two treatments would be significant at the 5% level? Ans. 30 grams. 
Calculate the coefficient of variation, assuming the average weight of all chicks to be 
250 grams. Ans. 30%. If greater precision is desired, would you double the number 
of chicks per pen or increase the replications to 8? Re-read section 11.10; or better, 
sex the chicks and design the experiment accordingly. 

17.9 — Effectiveness of stratification. After examination of a stratified 
random sample, doubt may arise as to the advantage of the design or 
interest may be excited as to its efficiency. Especially if other samplings 
of the same universe are planned, it is desirable to learn if there is any 
advantage of the stratified sample over one taken completely at random. 

An easy way to estimate the variance of a random sample from the 
results of a stratified sample is by constructing the analysis of variance of 
the sampled universe and from it calculating the total variance (mean 
square). The data of table 17.1 may be used to illustrate the case of 
equal populations with the same sized samples from them. Since s 2 = 44.29, 
the component for subdivision means is 

JV 2 = (800 - 44.29)/100 = 7.5571 

With these components, an analysis of variance for the universe is built 
up in table 17.7. The mean square for subdivisions is 

s 2 + Ns m 2 = 44.29 + (10,000) (7.5571) = 75,615, 

TABLE 17.7 

Analysis of Variance of Universe Constructed From Estimates of <r* and as 1 
Data from Tabic 17.1 



Degrees of 

Sum of 

Mean 

Source of Variation 

Freedom 

Squares 

Square 

Units within strata 

29,997 

1,328,600 

44.29 

Strata (or subdivisions) 

2 

151,200 

75,615. 

Totals 

29,999 

1,479,800 

49.33 


N = 10,000 being the size of each population. The first two mean squares 
in the table, multiplied by the corresponding degrees of freedom for the 
universe, yield sums of squares that add to the total. Finally, the mean 
square, 49.33, is the desired estimate of the variance in the unstratified 
universe. 

It can now be said: (i) That the information in the stratified sample is 
49.33/44.29 - 1 = 0.114 or 11.4% 



474 


Statistical Methods 


Section 17.9 


greater than that in the random; or (ii) that the information lost by aban- 
donment of stratification would be 


1 - 44.29/49.33 = 0.102 or 10.2% 

Since the first of the foregoing statements is the one usually interesting 
to samplers, a formula may be more convenient than construction of the 
analysis of variance. The gain in information due to stratification is 

•A r (m - 1)% 2 
(Nm - l)x J ’ 


m being the number of strata. Substituting, this gain in our example is 


(10,000) (3 - 1 ) (7.5571) 
(30,000 - 1) (44.29) 


= 11.4%, as above. 


Since Nm is often large, as here, it is then adequate to write the formula, 

N(m - 1 )s M 2 _ m - 1 sj/ 

Nms 2 ~~ m ' s 2 


The present data entered in this formula give the same result as before. 
These formulas apply only to strata of equal size. 

The solution of the problem with strata of varying size parallels that 
just given. The data in table 17.2 will be used as illustration. 

To calculate s M 2 , the formula for k 0 (section 10.8) is required: 


ko 


^rj( si ~ f ) “ K 1 ' 000 + 


404,054 \ 

1,000 / 


297.97 


From this, 

s M 2 = (1,893.50 - 127.85)/297.97 = 5.9256 


We shall require, also, an average of the stratum sizes, for which the 
same as above formula is used: 


No = 


m 



35,000 


525,000,000 \ 
35,000 / 


10,000 


Next, the analysis of variance in table 17.8 is constructed. For the 
sums of squares in the several strata, the mean squares of table 17.2 are 
multiplied by the degrees of freedom for the populations. The average 
mean square within the strata is the weighted mean, 


Jp 2 = 4,365,800/34,997 = 124.75 


The mean square for the stratum means is calculated on the assumption 
that it has the components, s M 2 = 5.9256 and j> 2 — 124.75; then 

J > 2 + Nos m 2 = 124.75 + (10,000)(5.9256) = 59,381 
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TABLE 17.8 

Analysis of Variance of Universe Constructed From Estimates in Table 17.2 


Source of Variation 

Degrees of 
Freedom 

Sum of 
Squares 

Mean 

Square 

Individual strata 

1 

9,999 

1,652,300 

165.25 

2 

19,999 

2,113,500 

105.68 

3 

4,999 

600,000 

120.03 

Within strata 

34,997 

4,365,800 

124.75 - sp > 

Strata 

2 

118,800 

59,381 

Total 

34,999 

4,484,600 

128.14 


is the mean square to be multiplied by dj . = 2 to get the stratum sum of 
squares, 118,800. 

Finally, the total sum of squares divided by dj. = 34,999 gives the 
mean square, 128.14, an estimate of the variance of a sample randomized 
over the universe, stratification ignored. 

We are now able to calculate the variance of the mean of the random 
sample, 128.14/1,000 = 0.12814, and to compare it with that of the strati- 
fied sample, 0.12353 (section 17.7). The increase in information due to 
stratification is 

(0.12814/0.12353) - 1 = 0.037 or 3.7% 

Two conclusions may be drawn, (i) Although the differences among 
strata are highly significant (table 17.2) the advantage to be gained by 
stratification of this sample is scarcely worthwhile if it costs much. That 
is because the differences among the stratum means are slight, (ii) In- 
accuracies in the weights might easily nullify the gain from stratification. 

The construction of the analysis of variance may be avoided by use of 
the following formula for the information in a stratified sample relative to 
that in the random: 

[1 + 1/(7* - m)][SwM - SM/T] + (1 - Sufis J 
(1 - 1 /T)(Sk)(Sw*M/k) 

T = SN = 35,000, the sum of the sizes of the m strata; w refers to the 
weights of the strata (0.28571, 0.57143, and 0.14286 of section 17.7); 
M stands for the mean squares of the several strata (165.25, 105.68, and 
120.03 of table 17.2); Sk = 1,000, the total sample size; and Sw 2 M/k 
= 0.12353, the variance of the mean of the stratified sample. Substitu- 
tion of the various quantities reduces the relative information to 1.037, 
the increase due to stratification being 3.7% as before. 

If T is large and m small the factors, [1 + 1 /{T — m)] and 1 — 1 /T> 
are each so close to unity as to be negligible. 
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EXAMPLE 17.12 — Show that N 0 may be computed from the formula, 

N q « T( 1 - Sw*)/{m - 1) 

In the Tennessee cotton data of example 17.5, T was 707,000 acres. Calculate 
No « 91,026 acres. 

EXAMPLE 17.13 — From the analysis of the sample of cotton yields of example 
17.6, calculate ko » 44.662 and su* — 3,816.4. 


EXAMPLE 17.14 — Compute the increase in information due to stratification in 
the cotton data, noticing SuPM/k = 20.527 in example 17.5. Ans. 18.5%. 


EXAMPLE 17.15 — Show that the weighted mean of the population (strata) mean 
squares is given by, 


sp 2 


TSwM - SM 
T - m ’ 


which reduces to SwM if 7' is large compared to m and SM. For the cotton, sp 1 «* 5,961 .78 


EXAMPLE 17.16 — Derive the formula for relative information by following the 
steps described for Table 17.8. 


EXAMPLE 17.17 — Using the weights and variances of the 6 districts in example 
17.5, design a sampling in which the k ’ s are proportional to N<r and Sk « 346. Ans. 
k « 140, 149, 20, 20, 6, and 11. 
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Meng, C.J., 271, 316, 336, 338 
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equation, 323 
linear, 107 
multiple, 340 
polynomial 

2nd degree, 379 
orthogonal, 388 
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Salmon, S. C., 46, 53 
Sample 
large, 169 

normally distributed, 55 
random, 2, 9, 42, 49, 57, 267, 455 
of means, 57 

from a normal distribution, 57 
representative, 1, 454 
small, 38, 175 
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Smith, F. B., 82, 88, 246, 252, 275, 316 
Smith, Stewart N., 79, 88 
Smoothing, 374 

Snedecor, George W., Ill, 126, 137, 141, 
168, 206, 213, 219, 235, 252, 262, 
295, 317, 324, 338, 339, 350, 359, 373 
Snedecor, James, 24, 30 
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of gi and gty 175, 177 
of mean, r*, 41, 61 
pooled estimate of, 76, 215 
of regression coefficient, 119, 349, 368 


Standard deviation — Continued 
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Switch-back experiment, 421 
Symbolical summary of computations 
in analysis of variance, 219, 231, 234, 
256, 261 

Symmetry, 55, 64, 147, 174, 433 
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discrete or discontinuous, 40, 440 
independent, 105 

Variation or variability, 1, 9, 15, 31, 40, 
58, 67, 68, 70 

coefficient of; see Coefficient 
see also Standard deviation and Vari- 
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x, mean value of X , 34 

Y, dependent variable, 105 
regression F, 106 

y, deviation of F from mean, 109 
#, deviation of f from 110 

y , mean of F, 109 
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